Job details
Job type full-time
Full job description
At lilly, we unite caring with discovery to make life better for people around the worldWe are a global healthcare leader headquartered in indianapolis, indianaOur 35,000 employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerismWe give our best effort to our work, and we put people firstWe’re looking for people who are determined to make life better for people around the worldEli lilly services india pvt ltd business insights & analytics team: data engineer the purpose of the lcci business insights & analytics team is to partner with us business insights & analytics in providing high quality analytical support to brand, market research and other internal partners through right data, smart analytics and actionable insightsOur team will be responsible for setting up the data warehouses necessary to handle large volumes of data, create meaningful analyses, and deliver recommendations to leadershipAs part of the lcci team, we are excited to offer the role of data engineer who will be an integral part of the data governance and analytics team in 2022 and beyondCore responsibilities
Create and maintain optimal data pipeline architecture etl/ elt into structured data
Assemble large, complex data sets that meet functional / non-functional business requirements and create and maintain multi-dimensional modelling like star schema and snowflake schema, normalization, de-normalization, joining of datasets.
Expert level experience creating fact tables, dimensional tables and ingest datasets into cloud based toolsJob scheduling, automation experience is must.
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Setup and maintain data ingestion, streaming, scheduling and job monitoring automationConnectivity between lambda, glue, s3, redshift, power bi needs to be maintained for uninterrupted automation.
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using sql and “big data” technologies like aws and google
Build analytics tools that utilize the data pipeline to provide actionable insight into customer acquisition, operational efficiency and other key business performance metrics
Work with stakeholders to assist with data-related technical issues and support their data infrastructure needs
Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader
Required
5-7 years of in-depth hands-on experience in data warehousing redshift or any olap to support business/data analytics, business intelligence (bi)
Advanced working sql knowledge and experience working with relational databases, query authoring (sql) as well as working familiarity with a variety of databases and cloud data warehouse like redshift
Data model development, additional dims and facts creation and creating views and procedures, enable programmability to facilitate automation
Prior data modelling, olap cube modelling in sql server, ssas and power bi experience
Experience with redshift and olap systems is mustGlue pipeline skill is must
Data compression into parquet to improve processing and finetuning sql programming skills required
Experience building and optimizing “big data” data pipelines, architectures and data sets
Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
Strong analytic skills related to working with structured and unstructured datasets
Experience with manipulating, processing and extracting value from large disconnected unrelated datasets
Working knowledge of message queuing, stream processing, and highly scalable “big data” stores
Experience supporting and working with cross-functional teams and global it
Familiarity of working in an agile based working models
Preferred qualifications/expertise
Experience with relational sql and nosql databases, including aws redshift, postgres and cassandra
Experience with aws cloud services preferable: s3, ec2, emr, rds, sage maker
Experience with stream-processing systems: storm, spark-streaming, etc.
Experience with object-oriented/object function scripting languages: python, java, r, c++ scala, etc.
Would it be helpful to include certifications as an added bonus here? i’ve found this is something it teams invest a lot of their time on to make sure they are up to date and is an indicator that they know the tech well?
Education:
Bachelor’s or master’s degree on technology and computer science background
Eli lilly and company, lilly usa, llc and our wholly owned subsidiaries (collectively “lilly”) are committed to help individuals with disabilities to participate in the workforce and ensure equal opportunity to compete for jobsIf you require an accommodation to submit a resume for positions at lilly, please email lilly human resources ( lilly_recruiting_compliance@lists.lilly.com ) for further assistancePlease note this email address is intended for use only to request an accommodation as part of the application processAny other correspondence will not receive a responseLilly does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status.
Hiring insights
Job activity
Posted 30+ days ago