The primary focus will be in applying data mining techniques, doing advance statistical analysis, and building high quality machine learning or deep learning models and deploying them as productive solutions to get integrated with existing IT digital solutions and products
Responsibilities
Lead discovery processes with business stakeholders to identify opportunities and business problems and framing them into an IT data science/ advanced data analytics project initiative.
Identification and extraction of available and relevant data from internal and external data sources to perform data science solution development.
Perform data cleansing using data processing and statistical software packages.
Perform data quality assessments and statistical testing to verify data quality and data integrity
Exploratory data analysis for extracting business insights from large volume of data using data science toolkits and statistical packages.
Translate and visualize data into information and insights to discover trends and patterns for solving the business problem and improving the Key Performance Indicators (KPIs)
Develop custom data models, standard statistical models, machine learning or deep learning algorithms for building diagnostic, predictive and prescriptive data science solution.
Coordinate with different functional agile teams such as data engineering and software development to deploy the data science solutions to production and integrate it with existing IT digital solutions and products
Ideal Profile
Master’s degree in Applied Mathematics, Statistics, Machine Learning, Electrical Engineering, Computer Science/Information Systems or related fields or MBA with quantitative focus.
Industrial research and development and advanced data analytics experience.
Experience in data mining for large volumes of data, extracting insights and building prediction and system optimization models.
Comprehensive statistical knowledge (regression/classification models, design of experiments, statistical testing etc.) and experience applying it to real-world projects.
Strong knowledge in the application of Machine Learning
Experience with common data science toolkits, such as SQL, R and/or Python with high proficiency in the use of opensource statistical and machine learning packages (numpy, pandas, scikit-learn, stats tool etc.
Experience delivering data science projects from model development to production deployment
Any of the following qualifications is a plus
Deep Learning algorithms (TensorFlow or Equivalent framework), ideally demonstrated by relevant industrial experience
Experience in deploying predictive or prescriptive data science solutions from any embedded electronic sensor data streams (such as IoT sensors, PLC systems, condition monitoring systems, wearables etc.)