GCP Data Engineer
Capgemini
Data Science
Mexico
Posted on Dec 18, 2024
RH: Raul García
Position: GCP Data Engineer
Location: Aguascalientes, Mexico
Industry - Sector: MALS
What you’ll do?
- Build, maintain, and optimize end-to-end ETL pipelines using Google Cloud Dataflow, Apache Beam, Cloud Composer, and Cloud Functions.
- Automate data ingestion, transformation, and integration processes from multiple sources (structured and unstructured data) into cloud-based data platforms.
- Design and implement scalable cloud-based data architectures using services like Google BigQuery, Google Cloud Storage (GCS), Cloud Pub/Sub, and Cloud Dataproc.
- Architect data solutions that are reliable, performant, and cost-effective for large-scale data processing and storage needs.
- Build and manage data lakes using Google Cloud Storage and ensure data is properly ingested, stored, and processed in an organized and secure manner.
- Implement Google BigQuery data warehouses, optimize data models for performance, and ensure they are scalable to handle large datasets.
- Leverage Google Cloud Dataflow or Apache Beam to transform data into formats suitable for reporting and machine learning models.
- Integrate diverse data sources such as relational databases, NoSQL databases, APIs, and flat files into the data pipeline architecture.
- Automate the deployment of data pipelines, processes, and infrastructure using Google Cloud Deployment Manager, Terraform, or Cloud Composer.
- Design and manage CI/CD pipelines for efficient deployment of data engineering solutions.
- Implement best practices for data governance, ensuring data privacy and compliance with security standards (e.g., encryption, data masking, and access control).
- Manage permissions and security with Google IAM, and ensure that sensitive data is securely handled and stored.
- Continuously monitor the performance of data pipelines and storage solutions, identifying opportunities for optimization and cost reduction.
- Troubleshoot and resolve data integration and pipeline issues to ensure smooth and efficient data processing.
- Collaborate with cross-functional teams (data scientists, analysts, business stakeholders) to understand data requirements and deliver high-quality data solutions.
- Provide support for data-related questions, troubleshooting, and assist with data visualization and reporting tasks.
- Document data engineering processes, including pipeline architectures, configurations, and best practices.
- Prepare and deliver regular updates on data engineering projects and their progress.
What you’ll bring:
- Expertise in Google Cloud BigQuery, Google Cloud Storage (GCS), Cloud Dataflow, Cloud Pub/Sub, Google Cloud Dataproc, Cloud Composer, Cloud Functions, and Cloud Bigtable.
- Experience in building ETL pipelines using Cloud Dataflow, Apache Beam, Google Cloud
- Ability to integrate structured and unstructured data sources into cloud-based systems.
- Strong experience working with Google BigQuery for building and managing data warehouses and running optimized analytical queries.
- Expertise in data ingestion, transformation, and integration using GCP tools, including Google Cloud Pub/Sub, Cloud Dataflow, and BigQuery.
- Proficiency in Python, SQL, and JavaScript for data engineering tasks, scripting, and automation.
- Experience with Apache Spark and PySpark is a plus for handling large-scale data processing.
- Experience with Google Cloud Deployment Manager, Terraform, or Cloud Composer to automate infrastructure and data pipelines.
- Strong understanding of Google IAM, data encryption, and secure data access practices.
- Proficiency in Git for version control and collaborative development.
Soft skills:
Work Underpressure, Quality at work, Results Oriented
What can YOU expect in a career with Capgemini?
- Working in a team environment, Consultants will focus on the analysis, design and development of technology-based solutions for Capgemini’s clients.
- You will work alongside technical, functional and industry specialists to assist with the development, implementation and integration of innovative system solutions including methods, techniques and tools.
- You will contribute to client satisfaction by providing timely and responsive value-added services and work products.
- Capgemini offers a competitive compensation and benefits package.
- Headquartered in Paris, France, Capgemini has a presence of more than 340 thousand professionals in Mexico distributed among 3 sites located in Mexico City, Monterrey and Aguascalientes. A deeply multicultural organization.
- Capgemini has developed its own way of working, the Collaborative Business ExperienceTM, and draws on Rightshore, its worldwide delivery model.
You will love this job because
- Capgemini focuses on giving each new hire a YOU-nique experience through our recruitment process and on-boarding program, as well as by helping you to build your own career and professional skills foundation.
- Capgemini provides a collaborative environment that embodies and holds the following stated values close to heart: Honesty, Boldness, Trust, Freedom, Team Spirit, Modesty, and Fun.
- Capgemini cultivates an atmosphere for development that enables YOU to be hands-on, planning for your growth, both horizontally and vertically.
“At Capgemini Mexico, we aim to attract the best talent and are committed to creating a diverse and inclusive work environment, so there is no discrimination based on race, sex, sexual orientation, gender identity or expression, or any other characteristic of a person. All applications welcome and will be considered based on merit against the job and/or experience for the position”