hero

Careers

Are you as enthusiastic about innovation as we are? Our portfolio companies are hiring.
OCA Ventures
60
companies
188
Jobs

Senior Data Engineer

Alembic

Alembic

Data Science
San Francisco, CA, USA
Posted on Mar 12, 2025

About Alembic

Alembic is a fast-growing Series A software startup focused on building cutting-edge solutions that transform how businesses harness and leverage data. We are a team of innovators, engineers, and product leaders passionate about solving complex problems with scalable, data-driven technology. At Alembic, we believe that great software is built by great people, and we are looking for a Data Engineer who thrives in a fast-paced, high-impact environment.

About the Role

As a Data Engineer at Alembic, you will be at the core of our data platform, building scalable and reliable data pipelines, optimizing storage solutions, and enabling real-time and batch analytics. You will work closely with data scientists, software engineers, and product leaders to design and implement robust data architectures.

Key Responsibilities

  • Design, develop, and maintain scalable ETL pipelines that ingest, process, and transform large volumes of structured and unstructured data.

  • Optimize data storage solutions using modern data lakehouse architectures and best practices for cost, performance, and reliability.

  • Collaborate with data scientists and engineers to integrate machine learning models and analytical workloads into production environments.

  • Ensure data integrity, quality, and security by implementing monitoring, alerting, and governance best practices.

  • Work with cloud-based data warehouses and distributed data processing frameworks.

  • Continuously evaluate and implement new technologies to improve data infrastructure and operational efficiency.

What We’re Looking For

  • 10+ years of experience in data engineering, software engineering, or a related field.

  • Strong expertise in SQL and Python for data processing.

  • Experience with modern data warehousing and lakehouse solutions (i.e. Iceberg or similar).

  • Proficiency in working with distributed systems and big data technologies (Apache Spark, Hadoop, Kafka, Flink).

  • Hands-on experience with cloud platforms (AWS, GCP, Azure) and related data services.

  • Deep understanding of data modeling, database design, and performance optimization.

  • Familiarity with CI/CD pipelines, containerization (Docker, Kubernetes), and infrastructure-as-code (Terraform, CloudFormation) for data pipelines.

  • Strong problem-solving skills, with a passion for building reliable, scalable, and maintainable data systems.

  • Excellent communication skills and the ability to collaborate in a cross-functional team.

Nice to Have

  • Experience with Graph Databases, NoSQL, or Time-Series Databases.

  • Familiarity with data privacy, governance, and compliance (GDPR, HIPAA, SOC 2).

  • Experience with machine learning pipelines and MLOps.

Why you might be excited about Alembic:

  • You want to build something that is both technologically challenging and solves a real customer need. You want a role with major upside that tackles a massive market opportunity.

  • You are a serial startup builder or want to learn more before becoming a founder yourself. Our team holds deep experience building and selling B2B marketing solutions that work.

  • You want to work where you can take a big swing at building something big while maximizing your personal growth.

Why you might not be excited:

  • If you only want to tell people

  • You prefer company practices with 100% built out process for every little detail.

  • You prefer static over dynamic. Projects, priorities, and roles will adapt to your skill set and your goals. Though we have a playbook for growth, we proudly remain an early stage startup.