About Me
An inspired Technical Leader with 5+ years of broad expertise in Full Stack development using Python, Node, Angular & React. A data management, processing ETL specialist with a willingness to expand in large scale data streaming, machine learning implementations.
Key Domains I’ve worked in:
- Healthcare Data Management (ETL)
- Taxation Data Management (ETL)
- Insurance Data Management (ETL)
- Elderly Care Data (ETL & ML models)
- Sales Data (ETL & ML models)
- Speech processing & Intelligent Voice Assisted systems
- Django, Flask based back-ends, API’s
- Front-end, hybrid mobile applications in Angular, Ionic, React-native
What am I good at ?
- Conducting Research, Implementation of theoretical models as working application modeules.
- Extraction, Transformation, Loading of data
- Efficient processing, aggregation and modification of data to extract out meaningful information as per the requirements
- Data visualization and building custom dashboards using Python, Angular or a coupled framework like Plotly DASH.
- Planning of end to end architecture, stack required for a product / application.
Work Experience
Bharti Institute of Public Policy,
Indian School of Business
Principal Data Scientist
November 2020 - Present
(Full Time)
- Perform hands-on data analysis and modeling with huge data sets for the India Data Portal (IDP)
- Discover data sources, import them, clean them up, and make them “portal-ready”
- Statistical modeling, model fitting, machine learning, data mining on large datasets
- Research and work with Data Engineering and Architecture teams to implement new technologies that will facilitate better data integrity, reliability, and enrichment of the portal
- Run regular tests and statistical analysis, draw conclusions on the accuracy of the data
- Dive deep into a wide range of data (agricultural, financial, etc.) to identify opportunities and recommend solutions.
- Assess the effectiveness and accuracy of new data and data gathering techniques.
- Perform ad hoc data mining, exploration, and statistical analyses on complex problem statements
- Write ETL pipelines to make data available for training and testing models both offline and in production.
- Working with Geo spatial and Satellite data and linking to economic indicators
Netsmartz Infotech
Technical Lead, AI & ML
June 2019 - October 2020
(Full Time)
- Driving the vision & strategy for AI & ML at Netsmartz leading a Team of Machine Learning Engineers and Python Full Stack Developers
- Technical architect responsible to architect PoC’s and applications based on:
- Computer Vision
- Machine Learning
- Natural Language Understanding, Processing and Generation
- Predictive Analysis
- Recommendation Engines
Excise and Taxation Department
Data Mining Expert
November 2018 - May 2019
(Fixed Term Contract)
Responsible for handling GST related data of U.T Chandigarh
My job here was not limited to, but included the following tasks:
- Build Data Pipelines for ETL(Extract, Transform & Load) operations on data received from the central GST portal
- Management of state-wise taxation data received from central GST Network Portal
- Separate pipelines to generate different kind of reports (for each ward in the state) and the required data-sets using complex Oracle SQL queries in conjunction with Python scripts
- Basic predictive analysis base on behavioral / transactional features of taxpayer
- Technologies used: Python, Pandas, Oracle SQL, SQLAlchemy, Scikit-Learn, Matplotlib
Sidekick EDGE Private Ltd
Technical Consultant
December 2016 - December 2017
(Part Time Consultancy)
(Role: Data Science, Front-end Development)
- Help in database structuring and designing
- Development of complete front-end in AngularJS for the company product/platform
- Managed a team of interns, developers responsible for front-end and back-end development
- Development of Mobile Application for the platform
- This platform is being used for multi-mode data collection in PGIMER and some other healthcare establishments
- Technologies used: AngularJS for frontend, Loopback.io, Nodejs for backend, MySQL database and Ionic for Mobile Application
- Domains worked in: Frontend Development, Hybrid Mobile Application Development, Data Visualization, Data Analysis, Basic Predictions (Machine Learning)
School of Public Health, PGIMER
Data Manager
December 2016 - May 2018
(Fixed Term Contract)
in research project entitled "Haryana Demonstration Project on Wheat Flour Fortification to improve Iron, Folate and Vitamin B12 status in Ambala District" funded by WHO, India and CDC, USA
- Designed and developed web-based application for survey data entry (with error checks & skip patterns) in Python
- Developed data entry manual, trained the data entry personnel and supervised the data entry session
- Optimization of mobile app (Ionic / AngularJS) linked with the web tracking system to record timestamps of sample collection and processing
- Perform dual data entry, check data output, data validations and checks to maintain accurate data on computer systems and in archives and take regular data backups
- Performed cleaning of compiled data sets after completion of data entry, the overall ETL (Extract, Transform, Load) process from two prime data sources, survey data and bio-sample data
- Data analysis (univariate, multivariate) and assisted the statistician in preparing final outcome tables
- Participated in visualizing the analysis outcomes and help to pen down the interpretations and the final report writing & editing process
- This base-line survey proved to be instrumental for Haryana government to implement the Policy of providing Wheat Flour fortified with Iron, Folic Acid, Vitamin B12 through PDS
- Technologies used: Python (Django, Pandas, Numpy, Scikit Learn), R Studio, Javascript (AngularJS/Ionic), Tableau, SPSS
- Domains worked in: Mobile/Cloud based survey development, Data Management, ETL, Data Analysis (Python, R)
Ecologic Corporation
Senior Developer
April 2015 - November 2016
(Full Time)
- Design and implementation of moderately complex software solutions using multiple platforms (mobile and web).
- Design and manage complex database architectures (MySQL, PostgreSQL).
- Building Data Extraction (Mining) processes, Transformation, Loading (ETL) from multi-dimensional databases / data stores and data exchange pipelines.
- Performed basic data analysis, data visualization, showing insights using Python (matplotlib, plot.ly), Javascript (D3.js)
- Code documentation, configuration management and deep level of debugging
- Fair degree of testing the technical scenarios (manually)
- User, Class, Network & System Architecture modelling as per technical scenarios
- Technologies used: PHP (Codeigniter, Zend), Javascript (AngularJS, Ionic, D3.js), Python (Django, Numpy, Pandas, Matplotlib, Scrapy), Nodejs, GIT, MySQL, PostgreSQL
- Domains worked in: Hybrid Mobile Application Development, Natural Language Processing, Web
Education
BBSB Engineering College
Masters of Technology
in Information Technology
2011 - 2016
My masters thesis work was in the NLP domain, which carried out partially while working at Ecologic Corporation. This thesis was the turning point when I started working in Python and slowly transitioned from PHP.
Nalsar Proximate Education (Nalsar University of Law)
Post Graduate Diploma
in Cyber Laws
2013 - 2014
(Correspondence)
Institute of Engineering & Technology
Bachelors of Technology
in Information Technology
2008 - 2011
Thapar Polytechnic College
Diploma
in Computer Science
2005 - 2008