What I Learned:
Technical Area: Web scraping with python, Data Cleaning, EDA, git commands
*Tools: * Jupyter Notebook, Github, BeautifulSoup, pandas
*Soft Skills: * Teamwork, Communication, Time Management, Researching
- Completed my first web scrape following a provided resource
- Learned how to use Jupyter Notebook and other tools to perform data cleaning and EDA
- Perform my first git push to a collaborative repository
- All team meetings
- Viewed ML Overview and Data Mining StemCasts
- Git Webinar
Goals for upcoming week:
Research and complete text analysis and communicate with my group to provide an update to the rest of the team
Detailed Statements of Tasks Done:
I used experience from my first web scrape as well as links of resources provided by leads to gather data from a SmartThings post on Jupyter Notebook. I managed to obtain the usernames and messages of everyone who made a reply in the post without too much trouble.
I did some data cleaning with the information I obtained from the first task by removing html, punctuation, and stop words while keeping URLs intact. I was able to accomplish this task thanks to more resources provided by my team lead and by cooperating with another teammate.
Finally, for EDA I created a bar graph on the frequency a person made a reply in the post from the previous tasks as well as create another graph to obtain the most used words in the post. Once again, I was able to overcome and struggles thanks to resources provided by my leads.
Additionally, I would like to upgrade from an observer to a participant.