Senior Data Engineer
Alembic
About Alembic
Alembic is a fast-growing Series A software startup focused on building cutting-edge solutions that transform how businesses harness and leverage data. We are a team of innovators, engineers, and product leaders passionate about solving complex problems with scalable, data-driven technology. At Alembic, we believe that great software is built by great people, and we are looking for a Data Engineer who thrives in a fast-paced, high-impact environment.
About the Role
As a Data Engineer at Alembic, you will be at the core of our data platform, building scalable and reliable data pipelines, optimizing storage solutions, and enabling real-time and batch analytics. You will work closely with data scientists, software engineers, and product leaders to design and implement robust data architectures.
Key Responsibilities
Design, develop, and maintain scalable ETL pipelines that ingest, process, and transform large volumes of structured and unstructured data.
Optimize data storage solutions using modern data lakehouse architectures and best practices for cost, performance, and reliability.
Collaborate with data scientists and engineers to integrate machine learning models and analytical workloads into production environments.
Ensure data integrity, quality, and security by implementing monitoring, alerting, and governance best practices.
Work with cloud-based data warehouses and distributed data processing frameworks.
Continuously evaluate and implement new technologies to improve data infrastructure and operational efficiency.
What We’re Looking For
10+ years of experience in data engineering, software engineering, or a related field.
Strong expertise in SQL and Python for data processing.
Experience with modern data warehousing and lakehouse solutions (i.e. Iceberg or similar).
Proficiency in working with distributed systems and big data technologies (Apache Spark, Hadoop, Kafka, Flink).
Hands-on experience with cloud platforms (AWS, GCP, Azure) and related data services.
Deep understanding of data modeling, database design, and performance optimization.
Familiarity with CI/CD pipelines, containerization (Docker, Kubernetes), and infrastructure-as-code (Terraform, CloudFormation) for data pipelines.
Strong problem-solving skills, with a passion for building reliable, scalable, and maintainable data systems.
Excellent communication skills and the ability to collaborate in a cross-functional team.
Nice to Have
Experience with Graph Databases, NoSQL, or Time-Series Databases.
Familiarity with data privacy, governance, and compliance (GDPR, HIPAA, SOC 2).
Experience with machine learning pipelines and MLOps.
Why you might be excited about Alembic:
You want to build something that is both technologically challenging and solves a real customer need. You want a role with major upside that tackles a massive market opportunity.
You are a serial startup builder or want to learn more before becoming a founder yourself. Our team holds deep experience building and selling B2B marketing solutions that work.
You want to work where you can take a big swing at building something big while maximizing your personal growth.
Why you might not be excited:
If you only want to tell people
You prefer company practices with 100% built out process for every little detail.
You prefer static over dynamic. Projects, priorities, and roles will adapt to your skill set and your goals. Though we have a playbook for growth, we proudly remain an early stage startup.