Week: Week I-II, July 20th - Aug 2
Overview of Things Learned:
- Ananconda/Jupyter Notebook
- Web Scraping
- Data Pre-processing
Technical Area:
- Determining a forum suitable for scraping
- Web scraping data from a forum
Tools:
- BeautifulSoup
- Selenium
Soft Skills:
- Asana
- Slack
- Collaboration and teamwork
Achievement Highlights
- Took initiative to organize the CodeCademy Team
- First time Web scraping!
Meetings attended
- Weekly ML Team6 Meetings
Goals for the Upcoming Week
- Finish Cleaning StackExchange CSV (Data pre-processing)
- Await integrated agenda and final CSV file
Tasks Done
- Deemed the CodeCademy forum a suitable data set to train the recommender system with BeautifulSoup
- Scraped the Title, Categories, Tags, Replies, and Post Content for my sub-categories on CodeCademy into a csv file using
- Complied all the Titles, Categories, Tags, Replies, and Post Content for all topics on Codecademy into a final csv file with my team