Back to positions

Sr. Data Engineer AWS Snowflake

Remote role Full-time Open position

About Fusemachines Fusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from underserved communities. With a robust presence in four countries and a dedicated team of over 400 full-time employees, we are committed to fostering AI transformation journeys for businesses worldwide. At Fusemachines, we not only bridge the gap between AI advancement and its global impact but also strive to deliver the most advanced technology solutions to the world. About the role: This is a remote, full time consulting position (contract) responsible for designing, building, and maintaining the infrastructure required for data integration, storage, processing, and analytics (BI, visualization and Advanced Analytics) to optimize digital channels and technology innovations with the end goal of creating competitive advantages for food services industry around the globe. We’re looking for a solid lead engineer who brings fresh ideas from past experiences and is eager to tackle new challenges.

We’re in search of a candidate who is knowledgeable about and loves working with modern data integration frameworks, big data and cloud technologies. Candidates must also be proficient with data programming languages (Python and SQL), AWS cloud and Snowflake Data Platform. The data engineer will build a variety of data pipelines and models to support advanced AI/ML analytics projects, with the intent of elevating the customer experience and driving revenue and profit growth globally.

Qualification & Experience:

  • Must have a full-time Bachelor's degree in Computer Science or similar from an accredited institution.
  • At least 3 years of experience as a data engineer with strong expertise in Python, Snowflake, PySpark, and AWS.
  • Proven experience delivering large-scale projects and products for Data and Analytics, as a data engineer.

Skill Set Requirement:

  • Vast background in all things data-related.
  • 3+ years of real-world data engineering development experience in Snowflake and AWS (certifications preferred).
  • Highly skilled in one or more programming languages, must have Python, and proficient in writing efficient and optimized code for data integration, storage, processing, manipulation and automation.
  • Strong experience in working with ELT and ETL tools and being able to develop custom integration solutions as needed, from different sources such as APIs, databases, flat files, and event streaming. Including experience with modern ETL tools such as Informatica, Matillion, or DBT; Informatica CDI is a plus.
  • Strong experience with scalable and distributed Data Technologies such as Spark/PySpark, DBT and Kafka, to be able to handle large volumes of data.
  • Strong programming skills in SQL, with proficiency in writing efficient and optimized code for data integration, storage, processing, and manipulation.
  • Strong experience in designing and implementing Data Warehousing solutions in AWS with Snowflake.
  • Good understanding of Data Modelling and Database Design Principles. Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions.
  • Proven experience as a Snowflake Developer, with a strong understanding of Snowflake architecture and concepts.
  • Proficient in Snowflake services such as Snowpipe, stages, stored procedures, views, materialized views, tasks and streams.
  • Robust understanding of data partitioning and other optimization techniques in Snowflake.
  • Knowledge of data security measures in Snowflake, including role-based access control (RBAC) and data encryption.
  • Experience with Kafka, Pulsar, or other streaming technologies.
  • Experience orchestrating complex task flows across a variety of technologies, Apache Airflow preferred.
  • Expert in Cloud Computing in AWS, including deep knowledge of a variety of AWS services like Lambda, Kinesis, S3, Lake Formation, EC2, ECS/ECR, IAM, CloudWatch, EKS, API Gateway, etc
  • Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent.
  • Good Problem-Solving skills: being able to troubleshoot data processing pipelines and identify performance bottlenecks and other issues.

Responsibilities:

  • Follow established design and constructed data architectures. Developing and maintaining data pipelines (streaming and batch), ensuring data flows smoothly from source (point-of-sale, back of house, operational platforms and more of a Global Data Hub) to destination. Handle ETL/ELT processes, including data extraction, loading, transformation and loading data from various sources into Snowflake to enable best-in-class technology solutions.
  • Play a key role in the Data Operations team - developing data solutions responsible for driving Growth.
  • Contribute to standardizing and developing a framework to extend these pipelines globally, across markets and business areas.
  • Develop on a data platform by building applications using a mix of open-source frameworks (PySpark, Kubernetes, Airflow, etc.) and best-in-breed SaaS tools (Informatica Cloud, Snowflake, Domo, etc.).
  • Implement and manage production support processes around data lifecycle, data quality, coding utilities, storage, reporting and other data integration points.
  • Ensure the reliability, scalability, and efficiency of data systems are maintained at all times.
  • Assist in the configuration and management of Snowflake data warehousing and data lake solutions, working under the guidance of senior team members.
  • Work with cross-functional teams, including Product, Engineering, Data Science, and Analytics teams to understand and fulfill data requirements.
  • Contribute to data quality assurance through validation checks and support data governance initiatives, including cataloging and lineage tracking.
  • Takes ownership of storage layer, SQL database management tasks, including schema design, indexing, and performance tuning.
  • Continuously evaluate and integrate new technologies to enhance data engineering capabilities and actively participate in our Agile team meetings and improvement activities.
Fusemachines is an Equal opportunity employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.

Originally posted on Himalayas

Apply To this Job

Further positions

Banking & Regulatory Reporting Governance, Manager

Remote role Full-time

Account Director

Remote role Full-time

Head of Business Development & Operations

Remote role Full-time

Seasonal Customer Service Representatives

Remote role Full-time

Backend Developer (Ruby on Rails)

Remote role Full-time

Planungs- und Prozessingenieur (m/w/d) [EUROPA]

Remote role Full-time

Full-Time SSA - PH Signal Hill - (5133)

Remote role Full-time

Project Manager - (ZR_24905_JOB)

Remote role Full-time

Senior Software Engineer - Mobile Android

Remote role Full-time

Cyber Security Architecture

Remote role Full-time

Store Interior Design Manager – Indeed Jobs US

Remote role Full-time

Experienced Part-Time Data Entry Typist – Remote Online Work Opportunity with Flexible Scheduling

Remote role Full-time

Business Development Representative

Remote role Full-time

Experienced Virtual Customer Care Professional – Remote Work Opportunity with arenaflex for Delivering Exceptional Service and Driving Customer Satisfaction

Remote role Full-time

Finance Business Partner, MENA , MENA-TR Finance

Remote role Full-time

Intermediate Accountant, Financial Reporting & Insights

Remote role Full-time

Fully Remote Amazon Customer Service - US (Work From Home)

Remote role Full-time

Experienced Data Entry Operator – Part-time Opportunity for Remote Work in Ohio

Remote role Full-time

Experienced Customer Service Representative – Remote Opportunity at arenaflex

Remote role Full-time

Experienced Customer Service Representative – Remote Opportunity for Delivering Exceptional Client Experiences and Driving Business Growth through Empathetic Support and Effective Problem-Solving

Remote role Full-time