Tues 6/16 Assessment
Technical Area Learned: Learned how to web scrape in Python, learned how to use Python on command line/terminal, learned how to use Juypter Notebook
Tools: Specific tools used were BeautifulSoup package, also learned how to use web developer tools on web browsers
Soft Skills: Communication, maintaining deadlines, general team building
The first achievement I had was learning how to understand web scraping. The second achievement was trying to get the contents from web scraping into a pandas DataFrame. Third achievement is succesfully doing a Juypter Notebook, as I had never really done it before.
Goals for me are to update my dataframe to include contents inside a post, as right now my dataframe only contains topics, tags and url. I would like to do some data cleaning and maybe EDA if time allows, and then learn about NLP and apply it to the dataset.
The main task I did was web scraping. I first had to watch and read lot to understand the general process of web scraping. There were many places to go to learn, and each had different styles, but I eventually settled on mainly using the Python requests package and Beautiful Soup. I was able to get the topic, url, and tags from the website my team chose to scrape, but as I stated, I would like to try and get more early on in the week. I then created a pandas DataFrame out of the columns.
Meetings attended:
6/8 Weekly Meeting
6/15 Weekly Meeting
I also attended many preliminary meetings before teams were assigned