Back to positions

Data Engineer

Remote role Full-time Open position

Innodata (Nasdaq: INOD) is a global data engineering company. We believe that data and Artificial Intelligence (AI) are inextricably linked. Our mission is to enable the responsible advancement of artificial intelligence by providing the data, evaluation frameworks, and human expertise required to build AI systems that can be trusted at scale. We provide a range of transferable solutions, platforms, and services for Generative AI / AI builders and adopters. In every relationship, we honor our 36+ year legacy delivering the highest quality data and outstanding outcomes for our customers. Scope of the Role: We are seeking a Data Engineer to design and build enterprise data warehouses, data lakes, and pipelines that power data-driven decision-making for data center supply chain and real estate operations. This role is responsible for creating scalable, secure, and optimized ETL infrastructure on GCP/AWS, while enabling advanced AI/ML use cases such as RAG, copilots, and agentic AI for predictive analytics and workflow automation. What You’ll Own: Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI. Build ETL scripts using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems. Develop and optimize data pipelines for ingestion, transformation, and loading into enterprise data lakes and warehouses. Build and extend end-to-end data and BI solutions, spanning extraction, storage, transformation, and visualization layers. Partner with supply chain, real estate, and AI/ML teams to provide pipelines for AI solutions (e.g., RAG ingestion, Copilot integration, multi-agent workflows). Ensure data governance, lineage, and compliance across supply chain datasets. Continuously optimize query performance, ETL processes, and pipeline reliability. You’ll Thrive in This Role If You Have: Advanced proficiency in SQL (complex queries, optimization) and Python (data engineering, scripting, APIs). Experience building ETL/ELT pipelines operating on structured and unstructured data sources. Knowledge of enterprise data warehouse and data lake architectures. Exposure to data pipelines for AI/ML (vector DB ingestion, embeddings, RAG pipelines, copilots, agents). Familiarity with supply chain or data center operations data is a strong plus. Bonus: experience with ML Engineering, data visualization tools (Looker, Tableau, Power BI) and MLOps practices. Strong hands-on expertise with GCP services: BigQuery, Dataflow, Pub/Sub, Cloud Storage, Looker/BI (or similar, preferred). Please be aware of recruitment scams involving individuals or organizations falsely claiming to represent employers. Innodata will never ask for payment, banking details, or sensitive personal information during the application process. To learn more on how to recognize job scams, please visit the Federal Trade Commission’s guide at https://consumer.ftc.gov/articles/job-scams. If you believe you’ve been targeted by a recruitment scam, please report it to Innodata at [email protected] and consider reporting it to the FTC at ReportFraud.ftc.gov. Apply To This Job

Further positions

Data Engineer

Remote role Full-time

Senior Programmer-SAS&R-Remote

Remote role Full-time

Inside Sales Representative

Remote role Full-time

Platform Engineer (Moodle Administrator)

Remote role Full-time

Applied Research Scientist, LLM Evaluation & Post-Training

Remote role Full-time

Financial Service Representative

Remote role Full-time

Weekend Scheduler (Logistics Operations)

Remote role Full-time

Expert Full-Stack Engineer (Get Discovered)

Remote role Full-time

Director of Sales - Lifecycle Services

Remote role Full-time

Podcast Content reviewer - Swedish (Sweden)

Remote role Full-time

Customer Care Associate – Part‑Time Remote Role | $17‑$18/hr + $50/mo Stipend + $500 Sign‑On Bonus | Join arenaflex’s Dynamic Customer Experience Team

Remote role Full-time

Experienced Bilingual Customer Service Representative (Spanish / English) - Remote Work Opportunity with arenaflex

Remote role Full-time

Manager, US Medical Affairs, HIV Treatment Strategy

Remote role Full-time

Experienced Live Chat Customer Service Representative – Remote Customer Support Team at arenaflex

Remote role Full-time

Detail-Oriented Data Entry Clerk with Competitive Pay & Flexible Schedule

Remote role Full-time

Part-Time Virtual Assistant & Data Entry Specialist – E-Commerce Operations (Remote Work from Home)

Remote role Full-time

Speech Language Pathologist

Remote role Full-time

Outbound Business Development Representative - AMER

Remote role Full-time

Executive Director, Online Investing Growth and Business Strategy - Remote Opportunity

Remote role Full-time

Medicaid LTC Financial Eligibility Case Reviewer - Remote

Remote role Full-time