Sion Kim - Machine Learning - Self Assessment

Week: Week I-II, July 20th - Aug 2

Overview of Things Learned:

  • Ananconda/Jupyter Notebook
  • Web Scraping
  • Data Pre-processing

Technical Area:

  • Determining a forum suitable for scraping
  • Web scraping data from a forum

Tools:

  • BeautifulSoup
  • Selenium

Soft Skills:

  • Asana
  • Slack
  • Collaboration and teamwork

Achievement Highlights

  • Took initiative to organize the CodeCademy Team
  • First time Web scraping!

Meetings attended

  • Weekly ML Team6 Meetings

Goals for the Upcoming Week

  • Finish Cleaning StackExchange CSV (Data pre-processing)
  • Await integrated agenda and final CSV file

Tasks Done

  • Deemed the CodeCademy forum a suitable data set to train the recommender system with BeautifulSoup
  • Scraped the Title, Categories, Tags, Replies, and Post Content for my sub-categories on CodeCademy into a csv file using
  • Complied all the Titles, Categories, Tags, Replies, and Post Content for all topics on Codecademy into a final csv file with my team