Pinned
Shruti Rijhwani
839 posts
- ✨Our TACL paper introduces a semi-supervised learning method for OCR post-correction (w/ Daisy Rosenblum, @gneubig, @anas_ant). We improve digitization accuracy on endangered languages by up to 29%! 📌Talk+poster at #EMNLP2021 on Nov 7th! 📌Paper: shrutirij.github.io/ocr-el/ 1/5
- There is *lots* of text in endangered languages that isn't machine-readable: paper books, handwritten notes, scanned images... Our #EMNLP2020 paper (w/ @gneubig, @anas_ant) addresses the task of extracting text from these sources. All the details: shrutirij.github.io/ocr-el/ 1/5
- Incredibly honored to be recognized on the 2022 @Forbes 30 under 30 list in Science! ✨*HUGE* thanks to my collaborators and mentors, @mulix, @anas_ant, @gneubig ✨More about my recent work: shrutirij.github.io/ocr-el/ @ForbesUnder30 #ForbesUnder30 forbes.com/30-under-30/20…
- Excited to share XTREME-UP, a benchmark dataset for NLP in under-represented languages!With the rapid development of language technology, it’s important that as many languages as possible benefit from these technologies, so we’re sharing XTREME-UP, a benchmark for evaluating multilingual models. 📝goo.gle/xtreme-up-paper 💻github.com/google-researc… Read 🧵↓ (1/3)
- I've seen multiple tweets today on temporal adaptation of NLP models, so I thought it might be a good time to promote our temporally-diverse dataset + analysis for NER! Paper: aclanthology.org/2020.acl-main.… Dataset: zenodo.org/record/3899040 (w/ @daniel_preotiuc and @TechAtBloomberg)
- 📢 I'm recruiting PhD students at MPI!! Topics include: 1⃣ LLM factuality, reliable info synthesis and reasoning, personalization + applications in real-world inc. education, science 2⃣ Data-centric interpretability 3⃣Creativity in AI, esp scientific applications 🧵1/2
- I'm giving a talk on entity linking for low-resource languages this Friday! Sign up on eventbrite or message me for the Zoom link!On June 25th, at 17:00 UTC, SIGTYP will host a lecture by Shruti Rijhwani (@shrutirij) on "Cross-Lingual Entity Linking for Low-Resource Languages." Registration: eventbrite.com/e/159198706617 Chat: sigtyp.inf.ethz.ch/channel/lectur…
- Replying to @emnlpmeetingOutstanding Area Chairs: Alla Rozovskaya Gaël Guibon Gerasimos Spanakis Jianhui Pang JinYeong Bak Jivnesh Sandhan Lei Li Lucy Li Mark G. Lee Matthieu Labeau Shruti Rijhwani Vered Shwartz Vivek Gupta Wei Zhao Xindi Wang
- The EMNLP 2024 call for papers is here! Submissions are through ARR, with a deadline of June 15 🚀✨ I'm also super excited to be on the organizing committee for the conference, as publicity co-chair! #EMNLP2024EMNLP 2024 will take place in Miami, Florida from Nov 12th to Nov 16th, 2024, at the Hyatt Regency Miami Hotel. More information: 2024.emnlp.org #EMNLP2024
- Excited to be in Mexico City for #NAACL2024! 🎉 Say hi if you see me around, or send me a message if would like to meet up!
- Excited to be a part of the organizing committee for EMNLP 2024! Looking forward to a great conference in Miami 🏖️✨Super excited to work alongside this amazing group of people organizing this year's @emnlpmeeting. If you haven't done so, check out the website: 2024.emnlp.org Looking forward to seeing everyone in Miami later this year! More info soon. #NLProc #EMNLP2024
- The code for our paper "OCR Post Correction for Endangered Language Texts" is now available: github.com/neulab/ocr-pos… Try it out to train OCR post-correction models for low-resource settings!There is *lots* of text in endangered languages that isn't machine-readable: paper books, handwritten notes, scanned images... Our #EMNLP2020 paper (w/ @gneubig, @anas_ant) addresses the task of extracting text from these sources. All the details: shrutirij.github.io/ocr-el/ 1/5
- I'm presenting my work (w/ Jiateng Xie, @gneubig and Jaime Carbonell) on low-resource entity linking at #AAAI19. Drop by the talk tomorrow (1/30) at 11.30am! Paper: aaai.org/Papers/AAAI/20…
















