YutongWang - Machine Learning Pathway

Name: Yutong Wang Team: ML Team 3 (Predictors)

Overview of Things Learned:

Technical Area: I did research on data annotation tools available for both enterprise use and personal use, and wrote a report of one of them. For machine learning part, I have built a web crawler for https://ai.stackexchange.com/ using scrapy library in order to get question titles, tags, individual question links and question content data.

Tools Used: Python, Pandas, JSON, Scrapy, Git, Colab, Visual Studio Code

Soft Skills: Communication with the team leader, tech leader and teammates about questions and confusion.

Achievement Highlights

  1. Successfully scraping the StackExchange,ai website through Scrapy
  2. Research and wrote a report of a data annotation tool called Scale

List of Meetings attended

  • All team meetings except the first time
  • Watch the other webinar recordings

Goals for the Upcoming Week

Build the training model for the data.

Tasks Done

  1. Task: Successfully scraping the StackExchange,ai website through Scrapy Hurdles: Having issues using scrapy to crawl at first, and the StackExchange website banned users from accessing once one IP address performed a high number of requests.
  2. Task: Research and wrote a report of a data annotation tool called Scale Hurdles: Very new to the idea of data annotation tool.