1. Overcoming the bottleneck of extracting and indexing hundreds of millions of academic papers to support a scholarly big data service: a case study of CiteSeerX Open Access Author: Keesara, Sai Raghav Title: Overcoming the bottleneck of extracting and indexing hundreds of millions of academic papers to support a scholarly big data service: a case study of CiteSeerX Graduate Program: Computer Science and Engineering Keywords: Information RetrievalInformation ExtractionDigital LibrariesSearch EngineScalabilityAcademic LibrariesElasticsearch File: Download Keesara_Thesis_May2021.pdf Committee Members: C Lee Giles, Thesis Advisor/Co-AdvisorBhuvan Urgaonkar, Committee MemberJian Wu, Special SignatoryChitaranjan Das, Program Head/Chair