[Remote] Senior Data/ML Engineer (AWS)

Worldwide Salaried Open

Note: The job is a remote job and is open to candidates in USA. Capnexus is a comprehensive services provider specializing in retail software development and support. They are seeking a highly skilled Senior AWS Data/ML Engineer to lead data architecture, pipeline development, and data integrations, leveraging advanced cloud data engineering skills and generative AI to modernize enterprise workflows.

Responsibilities

Participate in data discovery workshops to inventory source systems including property management platforms, marketing channels, and CRM data, and translate findings into data lake architecture requirements
Design and implement a multi-zone enterprise data lake on Amazon S3 (raw, conformed, enriched, aggregated) with ingest, cleansing, and business layers aligned to the SOW architecture
Build batch and streaming data ingestion pipelines using AWS Glue, Amazon Kinesis, and AWS Data Pipeline across CDP, marketing, and property management data sources
Implement data transformation and orchestration frameworks using AWS Glue ETL and AWS Step Functions, including AWS Glue Data Catalog for metadata management and discovery
Configure Amazon Athena for serverless SQL querying across the data lake; support QuickSight integration with curated data sets for business analytics
Develop and deploy ML models on Amazon SageMaker for lead scoring, predictive maintenance, intelligent underwriting risk scoring, and AI-powered audience segmentation
Integrate Amazon Bedrock foundation models to enable generative AI capabilities including customer profile enrichment, hyper-personalization, and intelligent marketing automation
Use Kiro CLI to accelerate AI-assisted development workflows, spec-driven pipeline implementation, and automated code generation tasks
Design and implement entity resolution pipelines using Amazon Entity Resolution to identify, deduplicate, and merge customer records into unified golden records
Implement real-time and batch data synchronization pipelines between source systems and the Customer Data Platform (CDP)
Support Azure data lake migration: conduct discovery, assess schemas and transformation logic, provision AWS target environments, execute migration via AWS DataSync, and perform data validation and reconciliation
Implement data lake security using AWS Lake Formation, including row-level security and column-level encryption
Build and maintain data models to support Customer 360 views, ML feature stores, and executive analytics dashboards
Ensure data quality, validation, and integrity across all pipeline stages and ML model outputs; support UAT for data-dependent features
Collaborate with Full Stack, DevOps/MLOps, and AWS engagement teams; contribute to architecture documentation, pipeline runbooks, and data governance documentation

Skills

5+ years of data engineering or ML engineering experience, with at least 2+ years in AWS cloud environments
Strong proficiency in Python and SQL; experience with AWS data services including S3, Glue, Athena, Kinesis, and Step Functions
Hands-on experience with Amazon SageMaker for model development, training, tuning, and endpoint deployment
Working knowledge of Amazon Bedrock for integrating and applying foundation models in production-grade pipelines
Experience designing and implementing multi-zone data lake architectures on Amazon S3, including lifecycle policies and Lake Formation governance
Familiarity with Kiro CLI or comparable AI-assisted/agentic development tooling
Experience with entity resolution, deduplication, or master data management concepts and tools
Solid understanding of data modeling, feature engineering, data quality practices, and ML integration testing
Experience with AWS Lambda and AWS Step Functions for serverless workflow orchestration
Familiarity with Amazon API Gateway for exposing data services and model endpoints
Strong analytical, problem-solving, and communication skills; comfortable working in Agile/Scrum teams alongside AWS Professional Services
Experience with Azure Data Lake, Azure Data Factory, or Azure Synapse — particularly in cloud-to-cloud migration contexts
Familiarity with Amazon Entity Resolution for customer identity and deduplication use cases
Experience with MLOps practices including model monitoring, drift detection, and automated retraining on SageMaker
Experience with LLM prompt engineering, RAG architectures, or fine-tuning workflows on Amazon Bedrock
Knowledge of Amazon QuickSight for analytics dataset preparation and embedded dashboard development
AWS Certification (Machine Learning Specialty, Data Analytics Specialty, or Solutions Architect)
Background in real estate, property management, marketing technology, or insurance industries

Benefits

Remote work

Company Overview

CapNexus is a consulting and IT services company that provides enterprise technology transformation services for companies. It was founded in 2026, and is headquartered in Middletown, New York, USA, with a workforce of 51-200 employees. Its website is https://capnexus.io.

Apply To This Job

Apply now

[Remote] Senior Data/ML Engineer (AWS)

More jobs

[Remote] Senior SAP MM Purchasing consultant

[Remote] Analytics Account Manager

[Remote] Data Engineer

[Remote] Python Developer withMCP (Model Context Protocol)

[Remote] Account Manager Oil & Gas

[Remote] Associate Director, Sales - Financial Services (PE Focus)

[Remote] AI Engineer

[Remote] Forward Deployment Engineer (Architect Level) — Google Cloud Platform

[Remote] Data Analyst / Programmer

[Remote] Head Of Business Development

Experienced Junior Data Entry Clerk – Remote Part-Time Position at arenaflex

Senior Manager, Medical Loss Ratio

[Remote] Tax Analyst, State & Local Tax (SALT) Indirect

Senior Specialist, Talent Solution Delivery, Enterprise

Experienced Customer Care Associate II - National General

Licensed Psychologist- Remote

Executive Assistant and HR Coordinator

Guest Experience Consultant

Technical Support Engineer (French-speaking)

[Remote] Customer Support Executive (Remote)