Radix Analytics Pvt Ltd

Issues & Objectives

  • To develop ML based system for predictions for Online Test Preparation Systems:
    • Students are asked questions for there exam preparations.
    • They may face issue with solving questions.

    • Our aim is to build a ML and NLP based system that can accurately predict the topic/chapter of that question.

    • Detecting appropriate topics removes the need for manual tagging and enables faster and frequent uploads of new questions/tests.

Solution

  • Natural Language Processing (NLP):
    • Images: Converted to text using appropriate Optical Character Recognition for different subjects.
    • Text: Converted to vectors using Word2Vec

  • Algorithm(s): Deep Neural Network + Random Forest

  • Storage: AWS Cloud

  • Database: MariaDB (SQL)

  • Deployment Framework: Flask API + Gunicorn

Project information

Skills

Machine Learning

Client

EdTech

Domain

Natural Language Processing

Location

India

Challenges

  • The questions are primarily on 4 subjects: Physics, Biology, Mathematics and Chemistry.

  • Questions are available as text, but many of them contains images of text, figures, equations and chemical diagrams.

  • Converting equations and chemical diagrams to appropriate formats for ML processing.

Results

Test recall (KPI) for different Subjects are given as below:

  • Physics: 92%

  • Biology: 88%

  • Chemistry: 89%

  • Mathematics: 89%