In this module, we broke down how to parse through all the papers we need to in the database. I learned how to use the PubMed parser and how to leverage Dask to process the papers in parallel. I also understand how to use the Stanford parser to further break down words and sentences.
Achievements: Processed a small subset of papers using the PubMed parser and sentences using the Stanford parser
Goals: Apply knowledge of Dask to process the entire dataset of PubMed papers.