Back

Officer - Data Engineer - C11 - Hybrid - CHENNAI @ Citi

Worldwide Salaried Open

This is a data engineer position - a programmer responsible for the design, development implementation and maintenance of data flow channels and data processing systems that support the collection, storage, batch and real-time processing, and analysis of information in a scalable, repeatable, and secure manner in coordination with the Data & Analytics team.The overall objective is defining optimal solutions to data collection, processing, and warehousing. Must be a Spark Java development expertise in big data processing, Python and Apache spark particularly within banking & finance domain. He/She designs, codes and tests data systems and works on implementing those into the internal infrastructure.Responsibilities: Ensuring high quality software development, with complete documentation and traceabilityDevelop and optimize scalable Spark Java-based data pipelines for processing and analyzing large scale financial dataDesign and implement distributed computing solutions for risk modeling, pricing and regulatory complianceEnsure efficient data storage and retrieval using Big DataImplement best practices for spark performance tuning including partition, caching and memory managementMaintain high code quality through testing, CI/CD pipelines and version control (Git, Jenkins)Work on batch processing frameworks for Market risk analyticsPromoting unit/functional testing and code inspection processesWork with business stakeholders and Business Analysts to understand the requirementsWork with other data scientists to understand and interpret complex datasetsQualifications:5- 8 Years of experience in working in data eco systems.4-5 years of hands-on experience in Hadoop, Scala, Java, Spark, Hive, Kafka, Impala, Unix Scripting and other Big data frameworks.3+ years of experience with relational SQL and NoSQL databases: Oracle, MongoDB, HBaseStrong proficiency in Python and Spark Java with knowledge of core spark concepts (RDDs, Dataframes, Spark Streaming, etc) and Scala and SQLData Integration, Migration & Large Scale ETL experience (Common ETL platforms such as PySpark/DataStage/AbInitio etc.) - ETL design & build, handling, reconciliation and normalizationData Modeling experience (OLAP, OLTP, Logical/Physical Modeling, Normalization, knowledge on performance… Apply To This Job

More jobs

Workplace Strategy and Data Lead / Lead Business Execution Consultant @ Wells Fargo

Worldwide Salaried

Clinical Appeals RN – IG Therapy / Biologics – Remote

Worldwide Salaried

Content Marketing Specialist, Database Tools

Worldwide Salaried

Operations Manager, VBCM- Remote

Worldwide Salaried

Hospital Senior Billing Rep - Benson Tower/Remote

Worldwide Salaried

Master Data Coordinator

Worldwide Salaried

Consumer Underwriter - Remote in Philadelphia, MD, VA, DE Territory

Worldwide Salaried

Customer Service Rep - 100% Remote

Worldwide Salaried

Remote Inbound Customer Service Representative

Worldwide Salaried

Student Success Coach, Unbound Academy (Remote) - $60,000/year USD

Worldwide Salaried

Regional Sales Manager (Porvoo, FI)

Worldwide Salaried

Directives Coordinator

Worldwide Salaried

Entry Level Apple Remote Jobs (Data Entry) $75000/Yearly ? WFH

Worldwide Salaried

Experienced Data Entry Professional and Administrative Support Specialist – Entry-Level Opportunity for Career Growth and Development with Walmart

Worldwide Salaried

Pfizer Sağlık Temsilcisi Hastane Takımı (İstanbul)

Worldwide Salaried

Experienced Pre-Licensed Customer Service Representative for Dynamic Insurance Support – Remote Work Opportunity with Comprehensive Training and Growth Prospects

Worldwide Salaried

Virtual Account Manager

Worldwide Salaried

Campus Launcher (Students can apply)

Worldwide Salaried

Talent Acquisition Supervisor

Worldwide Salaried

Truck Driver - Home Every Weekend in Davenport, IA

Worldwide Salaried