Role: Deep Learning Engineer | Machine Learning Engineer

Company: DQLabs https://www.intellectyx.com/ | https://www.dqlabs.ai/

Project: Augmented Data Quality Platform

Developed and implemented advanced machine learning and natural language processing techniques to enhance DQLabs.ai, an augmented data quality platform. The platform enables organizations to measure, monitor, remediate, and improve data quality across various types of data, leveraging ML and self-learning capabilities.

Technologies and Libraries:

Key Accomplishments:

  1. Developed a Deep Learning/Neural Network/NLP model to determine the semantic type of provided data.
  2. Built a system that identifies the semantic type (e.g., Name, SSN, Phone) for each column in a dataset.
  3. Integrated NLP and deep learning libraries and frameworks, including NLTK, Gensim, Keras, and PyTorch, into the Python codebase.
  4. Achieved an accuracy rate of 90% and above for each column type semantic detection.
  5. Developed the ability to build an API on top of the semantic type detection system to integrate with PySpark.

Throughout the project, I demonstrated strong expertise in natural language processing, deep learning, and big data technologies. By leveraging state-of-the-art algorithms like Sherlock and Sato, along with popular NLP and deep learning libraries, I successfully developed a highly accurate and scalable semantic type detection system. This system significantly enhances the data quality management capabilities of DQLabs.ai, empowering organizations to gain valuable insights and make data-driven decisions with confidence.