Company name: Blackstraw.ai
Job Type: Full-time
Experience: 10+ years
Role overview:
We are looking for an experienced big data architect to design and build scalable data platforms that support enterprise AI and analytics solutions. In this role, you will work closely with engineering teams and business stakeholders to define data architecture, design reliable data pipelines, and implement modern cloud-based data platforms.
This is a hands-on architecture role focused on solving complex data challenges and building systems that can scale across enterprise environments.
Key Responsibilities:
- Design and lead the implementation of scalable data architectures that support analytics, machine learning, and AI workloads.
- Architect and develop batch and near real-time data pipelines using modern big data frameworks and cloud platforms.
- Build and optimize data transformation workflows using technologies such as Spark, SQL, and PySpark.
- Design and implement Data Lake and Data Warehouse solutions on modern cloud data platforms, working across client environments that may include AWS, Azure, or GCP.
- Define and maintain data modeling standards using dimensional modeling approaches such as Kimball or Data Vault.
- Architect reliable ETL/ELT pipelines ensuring data quality, performance, and scalability.
- Review existing customer data architectures and recommend improvements aligned with business and technical requirements.
- Guide engineering teams on best practices for distributed data processing, system scalability, and performance optimization.
- Participate in architecture reviews, design discussions, and technical decision-making across projects.
- Enable data consumption for analytics and reporting using modern BI and visualization platforms.
- Work within agile delivery environments to deliver scalable and reliable data solutions.
Preferred Qualifications:
- 10+ years of experience in data engineering, big data systems, or distributed data platforms.
- Proven experience designing enterprise-scale data platforms supporting analytics and AI workloads.
- Strong hands-on experience with distributed data processing frameworks such as Apache Spark.
- Experience implementing modern data warehouse or lakehouse architectures using platforms such as Snowflake, Redshift, or similar technologies.
- Experience working with cloud-based data ecosystems across enterprise client environments.
- Familiarity with programming languages such as Java or Python is an advantage.
Key Traits:
- Strong problem-solving mindset and ability to work on complex data challenges.
- Excellent communication skills with the ability to explain technical concepts clearly.
- Comfortable collaborating with cross-functional and distributed teams.
- Self-driven with the ability to own architectural decisions and guide engineering teams.
Company Profile:
Blackstraw.ai is an end-to-end technology services company specializing in Artificial Intelligence (AI) and Engineering solutions across Data Science, Data Engineering, LLM/GenAI and LLMOps.
Founded in 2018, we help global enterprises across North America, Europe and Asia to build and operationalize AI systems that create measurable business impact. Our mission is to make AI adoption simpler, faster and scalable through a blend of deep domain expertise, reusable accelerators and proven engineering practices.
With a 450+ strong team of engineers, data scientists and AI specialists, we partner with organizations to deliver real-world outcomes in areas such as predictive analytics, computer vision, natural language processing and Generative AI.
At Blackstraw.ai, we’re passionate about solving complex business challenges through intelligent automation, modern data architectures and next-gen AI applications – enabling enterprises to move from AI experimentation to transformation.
Headquartered in Florida (USA) with operations in Canada and India, Blackstraw.ai continues to empower global enterprises to unlock the true potential of AI.