Below are some of my research works that are publicly available:
Books
1. Choudhary, Narayan (ed.). 2019. Linguistic Resources for AI/NLP in Indian Languages. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-295-8.
2. Cost Analysis of Linguistic Resources. 2018. Narayan Choudhary. CIIL, Mysore. ISBN: 978-81-7343-283-5
3. Proceedings of the Third Students’ Conference of Linguistics in India (SCONLI-3), 2011. ed. with Gibu Sabu M., Parimal Publishers, New Delhi. ISBN: 978-81-7110-383-6
4. Indian Language Part-of-Speech Tagset: Hindi, 2010. Co-authored with Kalika Bali, Monojit Choudhury, Priyanka Biswas, Girish Nath Jha, Maansi Sharma. Linguistic Data Consortium, Philadelphia. ISBN: 1-58563-571-5
For more details on books and book chapters, please check my institutional profile link at CIIL.
Papers
16. Choudhary, Narayan; Premkumar, LR; Singh, Chandan; Mondal, Shubhanan; Shivangi, Priya; Sudarshan, Beluru, Perumal Samy, Mohan, Shailendra. 2011. Bharatavani Project - Reviving Linguistic Diversity and Cultural Heritage in India: A Case Study, In Proceedings of the 2nd International Workshop on Digital Language Archives: LangArc 2023. University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting UNT College of Information. doi: https://doi.org/10.12794/langarc2114300
15. Shivangi Priya, Pushp Lata, Narayan Choudhary. 2021. Patterns of Language Use and Language Preferences of Maithili Speakers in Bihar. In Mithila Bharati, issue No. VIII, 2021. pp. 84-97.
14. Choudhary, N. 2021. LDC-IL: The Indian Repository of Resources for Language Technology. Language Resources & Evaluation. Springer, Vol. 55, Issue 1. doi: https://doi.org/10.1007/s10579-020-09523-3
13. N. Choudhary and D. G. Rao. 2020. The LDC-IL Speech Corpora. In Proceedings of 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), Yangon, Myanmar, 2020. pp. 28-32, doi: https://doi.org/10.1109/O-COCOSDA50338.2020.9295011
12. Nganthoibi Oinam, Diwakar Mishra, Pinal Patel, Narayan Choudhary, Hitesh Desai. 2018. A treebank for the healthcare domain. In Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018). pp. 144-155 ISBN: 978-1-948087-51-3
12. Nganthoibi Oinam, Diwakar Mishra, Pinal Patel, Narayan Choudhary, Hitesh Desai. 2018. A treebank for the healthcare domain. In Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018). pp. 144-155 ISBN: 978-1-948087-51-3
11. Parth Pathak, Pinal Patel, Vishal Panchal, Sagar Soni, Kinjal Dani, Narayan Choudhary, Amrish Patel. 2015. ezDI: A Supervised NLP System for Clinical Narrative Analysis. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). Denver, Colorado. The Association for Computational Linguistics. (ISBN: 978-1-941643-40-2) (This paper describes our approach for the task where we ranked 1st out of 21 participants globally)
10. Neha Dixit and Narayan Choudhary. 2014. Evaluating Two Annotated Corpora of Hindi Using a Verb Class Identifier. In Proceedings of Internationa Conference on Natural Language Processing (ICON 2014), Goa University, Goa. NLP Association of India (NLPAI).
9. Neha Dixit and Narayan Choudhary. 2014. Automatic Classification of Hindi Verbs in Syntactic Perspective. International Journal of Emerging Technology and Advanced Engineering, Volume 4, 8th Issue. ( ISSN 2250 – 2459 (Online))
8. Parth Pathak, Pinal Patel, Vishal Panchal, Narayan Choudhary, Amrish Patel, Gautam Joshi. 2014. ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014). Dublin, Ireland. ISBN 978-1-941643-29-7
7. Narayan Choudhary, Parth Pathak, Pinal Patel, Vishal Panchal. 2014. Annotating a Large Representative Corpus of Clinical Notes for Parts of Speech. In: Proceedings of 8th Linguistic Annotation Workshop, Dublin, Ireland. ISBN 978-1-941643-29-7
6. Narayan Choudhary, Girish Nath Jha and Pramod Pandey. 2011. A Rule based Method for the Identification of TAM features in a PoS Tagged Corpus. In: Proceedings of 5th Language Technology Conference, Fundacja Uniwersytety im. A. Mickiewicza, Poznan. Pp. 334-338 ISBN No.: 978-83-932640-1-8
5. Narayan Choudhary, Girish Nath Jha. 2011. Creating Multilingual Parallel Corpora in Indian Languages. In: Proceedings of 5th Language Technology Conference, Fundacja Uniwersytety im. A. Mickiewicza, Poznan. Pp. 85-89 ISBN No.: 978-83-932640-1-8 (This paper was awarded the Best Student Paper Award at the conference)
4. Narayan Choudhary. 2010. Web-drawn corpus for Indian Languages: A Case of Hindi. 2011. In: Proceedings of Information Systems for Indian Languages. Volume 139, Part 2, 218-223. Springer Verlag. ISBN No.: 978-3-642-19403-0_36
3. Narayan Choudhary. 2008. बोधात्मक भाषाविज्ञान. In: Gaveshana, April-June, 2008 vol.:90/2008 Central Institute of Hindi, Agra. 2008. pp.:11-18 (This is a translation of the article “Cognitive Linguistics” from Encyclopedia of Linguistics by Gilles Falkner, 2006)
2. Narayan Choudhary. 2006. Syllable Structure of Great Andamanese, 2006. In: Proceedings of National Seminar on Perspectives in Linguistics, Kashmir University, Srinagar, Kashmir. India 2007. Pp. 141-146
1. Narayan Choudhary, Anvita Abbi, Girish Nath Jha. 2007. Morphological Analyzer for Great Andamanese Verbs: Implementing a Concatenative Template. 2007. Co-authored by Anvita Abbi and Girish Nath Jha, in Vishwabharat ( April 2007 - January 2008 Journal) TDIL, New Delhi, pp.113-118 ISSN No.: 0972-6454
http://tdil.mit.gov.in/april-jan-2008/8.8_Morphological_analyzer.pdf
Dissertations
I submitted my PhD thesis titled Automatic Identification and Analysis of Verb Groups in Hindi in August, 2011 to Jawharlal Nehru University, New Delhi. The resultant tool can be viewed here. The dissertation itself can be downloaded here.
A look into my M.Phil. Dissertation titled Developing a Computational Framework for the Verb Morphology of Great Andamanese submitted to Jawaharlal Nehru University, New Delhi is available here.
Word Order in Pnar (Jaintia) is my MA dissertation submitted to Jawaharlal Nehru University, New Delhi on the lesser known language called Pnar (aka Jaintia) spoken by the Jaintia people of the state of Meghalay in India.