General Electric Senior Data Scientist in Bengaluru, India
Job Description Summary
As a Senior Data Scientist, you will be part of a cross-disciplinary team within Quality and Services facing development projects, typically involving large, complex data sets (mostly structured & semi-structured). These teams typically include data engineers, data visualization experts, architects, program managers, product managers, and end users, working in concert with partners in GE business units. You will work in teams addressing statistical, machine learning and data understanding problems in a commercial technology and consultancy development environment. In this role, you will contribute to the development and deployment of modern machine learning, operational research, semantic analysis, and statistical methods for finding structure in large data sets.
GE Healthcare is a leading global medical technology and digital solutions innovator. Our mission is to improve lives in the moments that matter. Unlock your ambition, turn ideas into world-changing realities, and join an organization where every voice makes a difference, and every difference builds a healthier world.
In this role you will:
Potential application areas include predicting recall rates, failures, complaints, classifying complaints data into topics/sub topics, remote monitoring and diagnostics across infrastructure and industrial sectors and operations optimization.
Develop analytics to address customer needs and opportunities. Work alongside software developers and software engineers to translate algorithms into viable products and services.
Work in technical teams in development, deployment, application of applied analytics, predictive analytics and prescriptive analytics.
Perform exploratory and targeted data analyses using descriptive statistics and other methods. Work with data engineers on data quality assessment, data cleansing and data analytics.
Generate reports, annotated code, and other projects artifacts to document, archive, and communicate your work and outcomes.
Share and discuss findings with team members.
Responsible to develop, train and deploy ML models (using multiple ML algorithms) using AWS Sagemaker/Python/AKS and monitor/improvise models in production.
Integrate domain data knowledge into development of data requirements.
Look across multiple systems, understands the purpose of each system and defines data requirements by systems.
Communicate on benefits, technique & approaches with key business stakeholders
Lead other horizontal improvement initiatives to benefit technology and leap further on a problem area or Hackathon etc
B asic Qualifications:
Bachelor's Degree in Computer Science, Information Technology or equivalent (STEM)
A minimum of 6+ year of similar experience working on Database(s), SQL, Python, Datawarehouse, Java, ETL and AWS cloud platform is required. AWS certifications would be added advantage
Experienced in Deployment process on-prem and on-cloud using Kubernetes, Dockers, Jenkins
Ability to drive projects in big data (structured/unstructured/machine/logs/streaming data types)
Demonstrated skill in Python, AWS Sagemaker, AKS, Kubernetes, SQL, Tableau, Power BI, Teradata.
End to End experience in ML model development, training, re-training, deployment, monitoring and improvement using Regression, Decision Trees, Random Forest, Forecasting (Arima etc), Deep Learning (CNN etc), Linear/Non Linear Programming (any tool).
Good understanding of Statistics (hypothesis testing, normality, linearity, probability, distribution, non linear distribution, sampling etc).
Demonstrated skill at data cleansing, data quality assessment, and using analytics for data assessment
Demonstrated skill in the use of applied analytics, descriptive statistics, feature extraction and predictive analytics on industrial datasets
Demonstrated skill at data visualization and storytelling for an audience of stakeholders
Demonstrated awareness of data management methods
Demonstrated awareness of real-time analytics development and deployment
Added advantage with experience in Healthcare datasets and industry knowledge
Demonstrated awareness of critical thinking and problem solving methods
Demonstrated awareness of presentation and influencing skills
Delivers results when working on shorter-term (weeks-months), outcome-focused service engagements
Leverages knowledge about technology trends, and changing business needs across the broad environment to bring new ideas to the team
Articulates the value proposition of existing technology capabilities and maps them to customer requirements to minimize incremental cost of development
Experienced in working with On-prem (Teradata) & AWS databases and able to work with large size datasets.
Proficient with query and programming languages like SQL, Python.
Experienced in one or more BI data visualization tool like – Spotfire, Tableau, Power BI
Inclusion and Diversity
GE Healthcare is an Equal Opportunity Employer where inclusion matters. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
We expect all employees to live and breathe our behaviors: to act with humility and build trust; lead with transparency; deliver with focus, and drive ownership – always with unyielding integrity.
Our total rewards are designed to unlock your ambition by giving you the boost and flexibility you need to turn your ideas into world-changing realities. Our salary and benefits are everything you’d expect from an organization with global strength and scale, and you’ll be surrounded by career opportunities in a culture that fosters care, collaboration and support.
Relocation Assistance Provided: Yes