Chirag Ahuja

Email: cahuja1992@gmail.com

Data Scientist/AI Engineer

Data Analysis, Machine Learning, Deep Learning, Computer Vision

A data scientist experienced in Banking, eCommerce, Human Resource, LMS and Digital Marketing and passionate about Artificial Intelligence and its applications like Self-Driving Car, Text and Music Generation.

Competency Synopsis

Data Analytics & Artificial Intelligence

Machine Learning: Supervised, Unsupervised & Basics of Reinforcement Using (R, Scikit Learn, Matlab, Mllib, Mahout, H2O and Azure ML).

Deep Learning: Tensorflow, Theaono, Keras

Text Processing: NLTK, Gensim, StanFord NLP Suite, GATE

Image Processing: OpenCV, HPI, Scikit Image

Audio Processing: PyAudio, LibROSA

Data Warehousing/Architecture

Databases and Tools: MySQL, MS SQL Server, Postgresql, Hadoop (Hive, HAWQ, Drill),

NoSQL: HBase, HDFS, Cassandra, MongoDB, Couch base, Elastic search, and Neo4j.

Other Tools: Hive, Pig, MapReduce, Spark

Development

Languages: Java, Scala, Python, Bash Scripting, JavaScript

Frameworks: Spring Boot, Python Flask, Oozie, Play & Akka

DevOps: Docker, Vagrant, Github, Bitbucket

Cloud PlatForms

AWS: EC2, Lambda, Kenisis, RDS, Redshift, EMR, DynamoDB

Azure: EventHub, Data Lake, ML Studio

Professional Experience

Tatras Data Services Pvt Ltd Formerly (Algorithmic Insights Pvt Ltd) (April 2015 - Current)

Data Scientist (Client Facing Role)

Leading Data Science team from technical front and involved in setting up PoCs with clients. Develop highly competent Artificial Intelligent algorithms and complex modelling techniques.

Highlights:

Automatic Text Generation, Headline Generation, Text Summarization using RNN with Attentional Encoders.

Footfall prediction of retail stores using time series forecasting using Deep Learning Architectures.

Developing Digital Marketing Automation Product (LTV, Customer Segmentation) using Spark, Google AdWords, and Hadoop Stack.

Architected and Developed Near- Real-Time news article and tag recommender system for online news portals (Hindi and English).

Developed an adaptive learning system using Probabilistic Graphical Theory and Bayesian Belief Network (Gartner Listed).

Content based Song Recommender System using Music Information Retrieval Techniques.

Developed Scoring of resumes with respect to the job description.

Designed a customer analytics schema for reporting and predictive analytics, and also

delivering prescriptive and predictive analytics based on customer interaction on monthly basis for business.

Aureus Analytics Pvt Ltd (December 2014-March 2015)

Hadoop Consultant (Client Facing Role)

Served as a consultant for pilot project on Customer Genome using Hadoop Stack at Client Side (Axis Bank, Mumbai), while dealing with customer demographic, transaction (CC, DC, POS), Insurance, Loan, Investment Data.

Highlights:

Configured SQL database to store Hive metadata and build a Hadoop data warehouse while importing data from Oracle Exadata using Sqoop and Linux Bash Scripting.

Build a Customer Genome for Marketing Analytics while different life stages as Married, Student, Kids, Retired, Earning etc and finally analysing Next Best Action (NBA) by calculating different customer events like, CASA, CARD, Investment, Loan Events using Pig, Hive, HCatalog, Oozie, Linux Bash, Java and SAS.

Educational Experience

Education and Training

Udacity: Nanodegree Machine Learning Engineer, Self-Driving Car Engineer

School Of I.C.T, Gautam Buddha University: Integrated B.Tech (Electronics & Communication Engineering) + M.Tech (Intelligent Systems).

Data-Flair: Hadoop, Storm and Kafka Training.

Edureka: Business Analytics using R.

Thappar University, Patiala: Summer School of Data Analysis and Machine Learning by Google India, NVIDIA, Infosys Mohali.

Academic & Independent Project

Detect Lane Lines: Detect highway lane lines from a video stream. Use OpenCV image analysis techniques to identify lines, including Hough transforms and Canny edge detection.

Traffic Sign Classification: Implement and train a convolutional neural network to classify traffic signs. Use validation sets, pooling, and dropout to choose a network architecture and improve performance.

Behavioral Cloning: Architect and train a deep neural network to drive a car in a simulator. Collect your own training data and use it to clone your own driving behavior on a test track.

Vehicle Tracking: Track vehicles in camera images using image classifiers such as SVMs, decision trees, HOG, and DNNs. Apply filters to fuse position data.

Song Genre Classifier: Implemented and train a convolutional recurrent neural network to classify music genres.

Reinforcement Learning: Train a Smartcab to Drive

M.Tech Dissertation: “Detection of Explicit Images on Large Scale using MapReduce” with assistance Xebia IT Architects India Pvt Ltd.

Developed Small IoT Applications eg(Home Automation System) using AVR, Raspberry Pi and Redis.

Publications

“Pro Apache Spark” upcoming book on Spark and Its ecosystem with Apress Media.

“Detection of nude images on large scale using Hadoop” , Ahuja Chirag, Baghel Anurag Singh , Singh Gotam, Print IEEE (ISBN:978-9-3805-4415-1), Page(s): 849 – 853.

“Human Gesture Recognition System using Computer Vision” in IJETAE (ISSN 2250­2459), Volume 4, Special Issue 1, February 2014, Page(s): 849- 853.

Part-time

1.Data Science Consultant for start-ups to bootstrap Artificial Intelligence and Data Science.

2.Freelance Big Data Hadoop and Spark Trainer.

3.Delivered Hadoop Training at National Institute of Technology, Bhopal on November

1­5, 2014. Short Term Training Program at NIT, Bhopal (MANIT) on “Hands on Big Data using Hadoop”

4.Delivered Java Training at WizIQ from January 2012-June 2012.