add citations training data from crossref unstructured references#864
add citations training data from crossref unstructured references#864miku wants to merge 1 commit intogrobidOrg:masterfrom
Conversation
|
Thanks a lot @miku ! |
|
Thanks for the review and please let me know, if there's a way to make them less tough. The set is basically a random shuffle of citation strings from crossref. |
|
@kermitt2 - would it be better, if I prepare another batch? |
|
I think you could not see my review (made 21 days ago :( ) |
| <tei xmlns="http://www.tei-c.org/ns/1.0" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML"> | ||
| <listBibl> | ||
| <bibl><author>Jasanoff, S.</author> (<date>2005</date>): <title level="m">States of Knowledge. The Co-production of Science and Social Order</title> -<pubPlace>London</pubPlace>: <publisher>Routledge</publisher>.</bibl> | ||
| <bibl><author>Bojko Krzysztof, Magdalena Góra</author>, <title level="m">Wybrane aspekty polityki Izraela, Stanów Zjednoczonych i Unii Europejskiej wobec Palestyńskiej Władzy Narodowej</title>, 2000-2007, <publisher>Księgarnia Akademicka</publisher>, <pubPlace>Kraków</pubPlace> <date>2007</date>.</bibl> |
There was a problem hiding this comment.
I think , 2000-2007 is part of the title.
| <bibl><author>Jasanoff, S.</author> (<date>2005</date>): <title level="m">States of Knowledge. The Co-production of Science and Social Order</title> -<pubPlace>London</pubPlace>: <publisher>Routledge</publisher>.</bibl> | ||
| <bibl><author>Bojko Krzysztof, Magdalena Góra</author>, <title level="m">Wybrane aspekty polityki Izraela, Stanów Zjednoczonych i Unii Europejskiej wobec Palestyńskiej Władzy Narodowej</title>, 2000-2007, <publisher>Księgarnia Akademicka</publisher>, <pubPlace>Kraków</pubPlace> <date>2007</date>.</bibl> | ||
| <bibl><author>Wickel</author>: <title level="a">Über stationäre Paralyse</title>. <title level="j">Allg. Z. Psychiatr.</title> <biblScope unit="volume">71</biblScope>, <biblScope unit="issue">360</biblScope> (<date>1914</date>).</bibl> | ||
| <bibl><author>Heinzel, C.</author>: <title level="m">Methoden zur Untersuchung und Optimierung der Kühlschmierung beim Schleifen</title>. <note type="report">Dissertation</note>, <publisher>University of Bremen</publisher>, <pubPlace>Bremen</pubPlace> (<date>1999</date>)</bibl> |
There was a problem hiding this comment.
<orgName> -> the institution for theses or technical reports
It applies to dissertation too.
| <bibl><author>Bojko Krzysztof, Magdalena Góra</author>, <title level="m">Wybrane aspekty polityki Izraela, Stanów Zjednoczonych i Unii Europejskiej wobec Palestyńskiej Władzy Narodowej</title>, 2000-2007, <publisher>Księgarnia Akademicka</publisher>, <pubPlace>Kraków</pubPlace> <date>2007</date>.</bibl> | ||
| <bibl><author>Wickel</author>: <title level="a">Über stationäre Paralyse</title>. <title level="j">Allg. Z. Psychiatr.</title> <biblScope unit="volume">71</biblScope>, <biblScope unit="issue">360</biblScope> (<date>1914</date>).</bibl> | ||
| <bibl><author>Heinzel, C.</author>: <title level="m">Methoden zur Untersuchung und Optimierung der Kühlschmierung beim Schleifen</title>. <note type="report">Dissertation</note>, <publisher>University of Bremen</publisher>, <pubPlace>Bremen</pubPlace> (<date>1999</date>)</bibl> | ||
| <bibl><author>Benguigui Y.</author> <title level="m">Infecções Respiratórias Agudas: Fundamentos Técnicos das Estratégias de Controle</title>. Série HCT / AIEPI -8.P. <pubPlace>Washington, DC</pubPlace>, OPS; c<date>1997</date>.</bibl> |
There was a problem hiding this comment.
<title level="s">Série HCT / AIEPI</title> -<biblScope unit="volume">8</biblScope>.P.
After some Google check, 8 is the volume for sure, but I could not clarify the P.
| <bibl><author>Birkmann, J., Bach, C., Guhl, S., Witting, M., Welle, T. and Schmude, M.</author> (<date>2010</date>) '<title level="m">State of the Art der Forschung zu kritischen Infrastrukturen am Beispiel Strom/Stromausfall</title>', <title level="s">Schriftenreihe Sicherheit, Forschungsforum Öffentliche Sicherheit der FU Berlin</title>.</bibl> | ||
| <bibl>/// <author>Szacki J.</author> <date>2002</date>. <title level="m">Historia myśli socjologicznej</title>, <publisher>Wydawnictwo Naukowe PWN</publisher>.</bibl> | ||
| <bibl><author>C. Seaman, & V. Basili</author>, \"<title level="a">An empirical study of communication in code inspections</title>\", in <title level="m">Proceedings of the 19th International Confer- ence on Software Engineering</title>, <date>1997</date>, pp. <biblScope type="page">96-106</biblScope>.</bibl> | ||
| <bibl><author>Silins, H. and Mulford, W.</author> (<date>2001</date>). <title level="a">Reframing Schools: The Case for System, Teacher and Student Learning</title>. <note>Paper presented at the Australian Association for Research in Education (AARE)</note>, Fremantle, December.</bibl> |
There was a problem hiding this comment.
<bibl><author>Silins, H. and Mulford, W.</author> (<date>2001</date>). <title level="a">Reframing Schools: The Case for System, Teacher and Student Learning</title>. Paper presented at the <title level="m">Australian Association for Research in Education (AARE)</title>, <pubPlace>Fremantle</pubPlace>, <date>December</date>.</bibl>
This is apparently a conference event.
| <bibl><author>C. Seaman, & V. Basili</author>, \"<title level="a">An empirical study of communication in code inspections</title>\", in <title level="m">Proceedings of the 19th International Confer- ence on Software Engineering</title>, <date>1997</date>, pp. <biblScope type="page">96-106</biblScope>.</bibl> | ||
| <bibl><author>Silins, H. and Mulford, W.</author> (<date>2001</date>). <title level="a">Reframing Schools: The Case for System, Teacher and Student Learning</title>. <note>Paper presented at the Australian Association for Research in Education (AARE)</note>, Fremantle, December.</bibl> | ||
| <bibl><author>V. F. Stolba</author>, « <title level="a">Graffiti and Dipinti</title> », p. <biblScope type="page">229</biblScope>, H 2, pl. 150, 156.</bibl> | ||
| <bibl><author>Ekingen, G.</author> <date>2004</date>. <title level="m">>A key to marine fishes of Turkey</title> <note>(in Turkish)</note>. <publisher>Mersin Üniversitesi Yayınları</publisher> No:12, Su Ürünleri Fakültesi Yayınları No:4, <pubPlace>Mersin</pubPlace>, 193 s.</bibl> |
There was a problem hiding this comment.
tough one :)
<bibl><author>Ekingen, G.</author> <date>2004</date>. <title level="m">A key to marine fishes of Turkey</title> <note>(in Turkish)</note>. <title level="s">Mersin Üniversitesi Yayınları</title> No:<biblScope type="volume">12</biblScope>, <title level="s">Su Ürünleri Fakültesi Yayınları</title> No:<biblScope type="volume">4</biblScope>, <pubPlace>Mersin</pubPlace>, <biblScope type="page">193</biblScope> s.</bibl>
| <bibl><author>A. v. Griesheim, W. Koehs, E. Pflfiger</author>, <title level="m">Beitr~ge zur Physiologie der Zeugung</title>; namentlieh die 2. Abh <author>Pflfiger, E.</author>, <title level="a">Einige Beobach- tungen fiber die das Geschlecht bestimmenden Ursachen</title>. <title level="j">Pfiiiger's Archiv</title>, Bd. <biblScope unit="volume">XXVI</biblScope>, p. <biblScope type="page">237--258</biblScope>.</bibl> | ||
| <bibl><author>L. ESPOSITO and A. TUCCI</author>, in <title level="m">Proceedings of the Third European Ceramic Society Conference</title>, Madrid, 12?17 September1993, Vol. <biblScope unit="volume">3</biblScope>, edited by <editor>P. DURAN and J. F. FERNANDEZ</editor> (Faenza Editrice Iberica, Castellon de la Plana, <date>1993</date>) p. <biblScope type="page">301</biblScope>.</bibl> | ||
| <bibl><author>Junge, M.</author> (<date>2007</date>). <title level="m">Simulationsgestützte Entwicklung und Optimierung einer energieeffizienten Produktionssteuerung.</title> <note type="report">Dissertation</note>. <orgName>Universität Kassel</orgName>.</bibl> | ||
| <bibl><author>BOYNTON, R. S.</author> (<date>1999</date>) <title level="a">¿Quién necesita la filosofía? Entrevista a Martha Nussbaum.</title> <title level="j">The New York Times Magazine</title>, noviembre. <note>Traducción de Carme Castells: ¿Quién teme a Martha Nussbaum?</note> Lectora, 9/2003, <biblScope type="page">3-10</biblScope>.</bibl> |
There was a problem hiding this comment.
<bibl><author>BOYNTON, R. S.</author> (<date>1999</date>) <title level="a">¿Quién necesita la filosofía? Entrevista a Martha Nussbaum.</title> <title level="j">The New York Times Magazine</title>, <date>noviembre</date>. <note>Traducción de Carme Castells: ¿Quién teme a Martha Nussbaum? Lectora, 9/2003</note>, <biblScope type="page">3-10</biblScope>.</bibl>
| <bibl><author>L. ESPOSITO and A. TUCCI</author>, in <title level="m">Proceedings of the Third European Ceramic Society Conference</title>, Madrid, 12?17 September1993, Vol. <biblScope unit="volume">3</biblScope>, edited by <editor>P. DURAN and J. F. FERNANDEZ</editor> (Faenza Editrice Iberica, Castellon de la Plana, <date>1993</date>) p. <biblScope type="page">301</biblScope>.</bibl> | ||
| <bibl><author>Junge, M.</author> (<date>2007</date>). <title level="m">Simulationsgestützte Entwicklung und Optimierung einer energieeffizienten Produktionssteuerung.</title> <note type="report">Dissertation</note>. <orgName>Universität Kassel</orgName>.</bibl> | ||
| <bibl><author>BOYNTON, R. S.</author> (<date>1999</date>) <title level="a">¿Quién necesita la filosofía? Entrevista a Martha Nussbaum.</title> <title level="j">The New York Times Magazine</title>, noviembre. <note>Traducción de Carme Castells: ¿Quién teme a Martha Nussbaum?</note> Lectora, 9/2003, <biblScope type="page">3-10</biblScope>.</bibl> | ||
| <bibl><author>Zoltman, Gerald and Burger, P h i l i p C.</author>, <title level="m">Marketing Research: Fundamentals and D y n a m i c s</title> , Hinsdale, Ill.: <publisher>The Dryden Press</publisher>, <date>1975</date>.</bibl> |
There was a problem hiding this comment.
<pubPlace>Hinsdale, Ill<pubPlace>
some OCR error for IL Illinois :)
| <bibl><author>BOYNTON, R. S.</author> (<date>1999</date>) <title level="a">¿Quién necesita la filosofía? Entrevista a Martha Nussbaum.</title> <title level="j">The New York Times Magazine</title>, noviembre. <note>Traducción de Carme Castells: ¿Quién teme a Martha Nussbaum?</note> Lectora, 9/2003, <biblScope type="page">3-10</biblScope>.</bibl> | ||
| <bibl><author>Zoltman, Gerald and Burger, P h i l i p C.</author>, <title level="m">Marketing Research: Fundamentals and D y n a m i c s</title> , Hinsdale, Ill.: <publisher>The Dryden Press</publisher>, <date>1975</date>.</bibl> | ||
| <bibl><author>Keupp, H./Röhrle, B.</author>: <title level="m">Soziale Netzwerke</title>. <pubPlace>Frankfurt a. M.</pubPlace> <date>1987</date></bibl> | ||
| <bibl><author>SILVA, A. C. da.</author> <title level="a">A desconstrução da Discriminação no Livro didático</title>. In: <author>Munanga, Kabengele</author>. <title level="m">Superando o Racismo na escola</title>. <pubPlace>Brasília</pubPlace>: <orgName>Ministério da Educação, Secretaria de Educação continuada, Alfabetização e Diversidade</orgName>, p. <biblScope type="page">21-37</biblScope>. <date>2005</date>.</bibl> |
There was a problem hiding this comment.
In: <editor>Munanga, Kabengele</editor>.
(after google check)
| <bibl><author>SILVA, A. C. da.</author> <title level="a">A desconstrução da Discriminação no Livro didático</title>. In: <author>Munanga, Kabengele</author>. <title level="m">Superando o Racismo na escola</title>. <pubPlace>Brasília</pubPlace>: <orgName>Ministério da Educação, Secretaria de Educação continuada, Alfabetização e Diversidade</orgName>, p. <biblScope type="page">21-37</biblScope>. <date>2005</date>.</bibl> | ||
| <bibl><author>B�ssler R</author> (<date>1974</date>) <title level="a">Pathologische Anatomie der Gallenwegserkrankungen</title>. In: <editor>Becker V</editor> (Hrsg) <title level="m">Gastroenterologie und Stoffwechsel, Aktionen und Interaktionen</title>. <publisher>Witzstrock</publisher>, <pubPlace>Baden-Baden</pubPlace></bibl> | ||
| <bibl>Vgl. zu den Merkmalen der Informationsqualität <author>Weißenberger</author> (<date>1997</date>). S. <biblScope type="page">35</biblScope> sowie allgemein zu den Eigenschaften von Informationen <author>Wild</author> (<date>1982</date>), S. <biblScope type="page">124ff.</biblScope></bibl> | ||
| <bibl><author>Goldstein</author>: <title level="m">Handbuch der inneren Medizin</title> von <author>Mohr und Staehelin</author>, Bd. <biblScope unit="volume">5</biblScope>, 1. <date>1925</date>.</bibl> |
There was a problem hiding this comment.
It's not easy to understand this one. Is Goldstein an author? (he's not an author of the handbuch. What is the 1. ?
We might better drop the example?
| <bibl><author>Keupp, H./Röhrle, B.</author>: <title level="m">Soziale Netzwerke</title>. <pubPlace>Frankfurt a. M.</pubPlace> <date>1987</date></bibl> | ||
| <bibl><author>SILVA, A. C. da.</author> <title level="a">A desconstrução da Discriminação no Livro didático</title>. In: <author>Munanga, Kabengele</author>. <title level="m">Superando o Racismo na escola</title>. <pubPlace>Brasília</pubPlace>: <orgName>Ministério da Educação, Secretaria de Educação continuada, Alfabetização e Diversidade</orgName>, p. <biblScope type="page">21-37</biblScope>. <date>2005</date>.</bibl> | ||
| <bibl><author>B�ssler R</author> (<date>1974</date>) <title level="a">Pathologische Anatomie der Gallenwegserkrankungen</title>. In: <editor>Becker V</editor> (Hrsg) <title level="m">Gastroenterologie und Stoffwechsel, Aktionen und Interaktionen</title>. <publisher>Witzstrock</publisher>, <pubPlace>Baden-Baden</pubPlace></bibl> | ||
| <bibl>Vgl. zu den Merkmalen der Informationsqualität <author>Weißenberger</author> (<date>1997</date>). S. <biblScope type="page">35</biblScope> sowie allgemein zu den Eigenschaften von Informationen <author>Wild</author> (<date>1982</date>), S. <biblScope type="page">124ff.</biblScope></bibl> |
There was a problem hiding this comment.
It would be 2 references here I think:
<bibl>Vgl. zu den <title level="m">Merkmalen der Informationsqualität</title> <author>Weißenberger</author> (<date>1997</date>). S. <biblScope type="page">35</biblScope></bibl>
<bibl>sowie allgemein zu den <title level="m">Eigenschaften von Informationen</title> <author>Wild</author> (<date>1982</date>), S. <biblScope type="page">124ff</biblScope>.</bibl>
|
Hi @miku ! I am preparing slowly a new Grobid release... shall I update the annotations myself and merge the PR? |
Corrected ref data from PR #864
A follow up on #854.