Some things that I have learned are enhanced uses of python including iterating over large sets of data, using dictionaries, and pickle files. In addition, I learned about embedding, neural networks, RNNs, how words can be converted into vectors and uses for BERT. Through this process, I learned how to use VSCode, Github, Colab, and python libraries such as beautiful soup. Some soft skills that I learned were in communication and how it is important to keep messages clear and concise.
- I was able to download VSCode, download the libraries, and open up a json file that contained the questions that needed to be parsed
- I wrote an algorithm that went through thousands of submissions in a way that was more efficient than how it was orignially written
- I wrote code that was able to extract the text from a question on a discourse form and save it to a pickle file.
I attended the meetings every Monday and Friday to report any issues that I was having as well as learn about new concepts. I watched the recorded video from the industry leaders about Github.
Goals for upcoming week
I am working on creating a recommendation system. Each member of the group is able to put their own creative twist to this recommendation system. I am building a recommendation system that will output 5 questions that are most similar to the question that was asked.
- Joined Slack and GitHub for Stem-Away ML Team 5.
- Made Colab notebooks on Google account for scrapping.
- I chose the discourse forum that I would like to implement (breadtopia)
- Wrote python code to create URLs with submissions and extracted the individual submissions into a dictionary. The main outline was written by team leads, but I wrote the function to parse through the submissions.
- Wrote Python code to extract text from each submission. There was an issue, but I resolved it myself by implementing a sleep function, which got rid of the error I was originally receiving. The main outline was written by team leads.