Skip to content

Update affiliation process#1069

Merged
kermitt2 merged 15 commits intomasterfrom
update-affiliation
Dec 26, 2023
Merged

Update affiliation process#1069
kermitt2 merged 15 commits intomasterfrom
update-affiliation

Conversation

@kermitt2
Copy link
Copy Markdown
Collaborator

@kermitt2 kermitt2 commented Dec 18, 2023

This PR updates the usage of the affiliation-address model:

  • review of features
  • a bit of training data correction
  • update the models
  • using clusteror for decoding the labeled result (much simpler and clean)
  • optional PDF coordinates for affiliation structure in the resulting TEI XML

For best results, use the Deep Learning model, which gives significantly more accurate recognition.

@kermitt2 kermitt2 requested a review from lfoppiano December 18, 2023 10:31
@kermitt2
Copy link
Copy Markdown
Collaborator Author

Still to do:

  • update documentation for affiliation coordinates,
  • consider coordinates for sub-structures of the affiliation orgName and address elements (country, etc.)
  • tests

@coveralls
Copy link
Copy Markdown

Coverage Status

coverage: 39.845% (-0.1%) from 39.959%
when pulling c776e3c on update-affiliation
into 6bd974d on master.

@kermitt2 kermitt2 merged commit 2ca3f35 into master Dec 26, 2023
@lfoppiano lfoppiano deleted the update-affiliation branch March 21, 2026 20:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants