Machine Learning algorithms to recommend topics & cluster tags using Discourse as a case study

Machine Learning algorithms to (1) recommend topics and (2) cluster tags
using Discourse Forums as a case study.

  • This topic covers the overview of upcoming Machine Learning Projects. We are expecting to run 3 distinct projects:
    1. Collaborative filtering based topic suggestions
    2. Content based topic suggestions
    3. Content based tag clustering
  • This is a pilot project which will help evolve the Mentor Chains® platform. In addition to the display of technical and soft skills through the 1-Click® Resume, participants can elect to be showcased on our social media pages.
.

Project Overview:

All projects involve the analysis of Discourse Forums to decide input data and methods of collecting input data. The ML teams can interface with the team working on Discourse plugin development for help with data collection. The data mentioned below is a starting point.
  1. Collaborative Filtering based topic suggestions
    • Idea
      • Suggest posts based on user’s history
    • Data
      • Direct Measures: Likes
      • Indirect Measures: Views, Number of Replies
    • Algorithm
      • Collaborative Filtering process

  2. Content Based topic suggestions
    • Idea
      • Suggest posts based on textual semantic similarity
    • Data
      • Title, main post, replies (text)
      • Tags
      • Author
      • Date published
    • Algorithm
      • Embedding the text itself
        • TF-IDF, CountVec, Spacy
        • Deep Learning models - BERT, XLNET
      • Similarity between posts
        • Cosine similarity
        • KNN model
      • Bring other attributes into model
        • Same author
        • Set intersection of tags
        • Post closer in time

  3. Content Based tag clustering
    • Details to be added shortly

Skills:

  • Fundamental skills necessary for Machine Learning
  • Deep Learning models - BERT, XLNET
  • Similarity models - Cosine similarity, KNN model
  • Collaborative filtering
  • Communication
  • Teamwork
  • Leadership & Mentoring (project & task leads)

Dates:

Summer 2020

Prerequisites: