BI Architecture Coordinator
Kinesso
IT
Mumbai, Maharashtra, India
Posted on Jan 17, 2025
Support BI Data Architecture across all clients and disciplines focusing on technology, processes, and tools, ensuring that all BI team is technically equipped to perform their jobs. Main responsibilities include:
Design and execution of modern data architectures:
- Development of data ecosystems using tools like Snowflake, SQL Server, ensuring scalability, performance, and security.
- Creation of advanced ETL pipelines to centralize, harmonize, and categorize data from heterogeneous sources (Data Warehouse, marketing systems, Digital Media APIs, and others that can be integrated).
- Implementation of Data Mesh-based architectures to promote decentralization of data access.
Advanced cloud platform management:
- Optimization of cloud platforms (AWS, Azure, Snowflake) for distributed data processing, reducing costs, and increasing efficiency.
- Intensive use of Snowflake for managing large data volumes with features like data sharing, time travel, and multi-cluster warehouses.
Automation and APIs for real-time integration:
- Development and consumption of RESTful and GraphQL APIs to enable real-time integration flows between internal and external systems.
- Automation of deployments and data pipelines using tools like GitHub Actions, Terraform.
Data governance and security:
- Definition of governance policies.
- Oversight of processes to ensure data quality and regulatory compliance.
Leadership and strategic collaboration:
- Coordination of technical teams on high-impact projects, ensuring alignment with business objectives.
- Fostering a data-driven culture through training and the establishment of best practices in the use of cloud platforms and BI tools.
Support Areas:
- Snowflake: Management of Snowflake-related requests (access, new schemas, permissions, etc.).
- Remotes (Azure, AWS): Azure and AWS support for the Regional Cloud Engineering IT team, initial contact: Danilo Ho, Senior Systems Administrator, last instance: Emma Gutierrez, Senior Manager, Cloud Engineering. For incident resolution, please first submit a ticket by email to ESD.
- Alteryx: Support for activation and use of Alteryx licenses with the Regional Cloud Engineering IT team. Initial contact: Danilo Ho, Senior Systems Administrator, for application support and Alteryx server Gallery access. For support requests and new requests, please submit a ticket by email to ESD.
- GitHub: Support for adding to the repository: Oballe, Humberto, IT Developer Manager LATAM, for requesting additions or changes of permissions in the LA-MBW-BI_ENGINEERING GitHub repository. AWS Cloud Engineering IT Regional team with Villacreses, Carlos, Senior Cloud Engineer.
Tasks:
Snowflake:
- Support connections and address issues that may arise in the workflows managed by the LATAM Insights & Analytics team members (current leader: Claudia Vargas). Additionally, manage updates to application items, monitor optimal functioning of schemas and tables storing information, manage and supervise Snowflake connections.
- Design and configure database structures, schemas, roles, and access policies.
- Evaluate updates or new Snowflake capabilities, such as data sharing, unstructured data support, or the use of Snowpark.
- Administer ODBC connections for remote teams connected to SF.
- Help resolve incidents related to workloads, slow queries, or access issues.
- User management: create, modify, and revoke user access, setting appropriate roles and permissions based on the least privilege principle.
- Monitor failures caused by misconfiguration when creating new tables in the GR_KINESSO database, which consists of several schemas created and used depending on the client.
- Technical advice: Guide team members in optimizing SQL queries, using advanced features like time travel and cloning, and following Snowflake best practices.
- Ensure data integrity (information security).
- Data modeling: Assist in creating efficient data models for reports and analysis.
- Automation: Configure and optimize data pipelines using tools like Snowpipe or integrations with ETL processes.
- Query optimization: Identify inefficient queries and collaborate to improve them, reducing costs and improving performance.
- Supervise permissions granted to different schemas, primarily: (MBLA_AGENCY_LA, MBLA_NESTLE_LA).
- Ensure that the architecture is designed to handle growth in data volume and users.
Remotes (Azure, AWS):
- Support the operation of these teams.
- Ensure that each remote has the minimum operability tools required for the assigned user.
- Manage connections to different data sources.
- Manage applications used for user task fulfillment.
Alteryx:
- Alteryx licenses: Ensure the allocation of Alteryx licenses to team users. If a new license is required, manage it according to the needs.
- Keep versions updated to ensure compatibility with the Alteryx server (Gallery).
- User management: Configure roles and permissions for users, ensuring they have appropriate access to workflows and data.
- Monitor the Alteryx server (Gallery): Oversee server performance, disk space, and resources to avoid bottlenecks.
- Updates and patches: Coordinate the installation of new versions or updates to leverage improvements and new features.
- Troubleshoot workflow errors: Assist in diagnosing and resolving issues with workflows that don't work properly.
- Workflow optimization: Identify and suggest improvements to optimize workflow efficiency and execution time.
- Technical assistance: Provide support to users to resolve issues related to specific tools or configurations.
- Scheduling: Help schedule workflows to run automatically on Alteryx Server.
- Validate results: Ensure processed data meets expected standards and is error-free.
- Integration with other tools: Facilitate Alteryx connection with external systems (databases, cloud platforms, APIs, etc.).
- Review workflows: Identify redundant workflows or those that could benefit from best practices.
- Auditing: Perform regular reviews to ensure compliance with corporate standards, security, and quality.
- Innovation: Propose new ways to leverage Alteryx to solve problems or automate processes.
Github:
- Implement rules for key branches, such as requiring code reviews before merge or triggering automatic tests.
- Define conventions for branch names, commits, and pull request (PR) messages.
- Configure GitHub Actions or similar integrations to run tests, verify linting, or deploy data pipelines automatically.
- Manage workflows for deploying solutions in development, test, and production environments.
- Monitor workflow executions to detect recurring errors and resolve them quickly.
- Add and remove collaborators based on project needs.
- Delete obsolete branches and ensure the repository doesn't accumulate unnecessary or inefficient code.
- Ensure tokens, API keys, and sensitive credentials are managed outside the repository (e.g., using GitHub Secrets).
- Review repository activity logs for unauthorized access or insecure practices.
- Assist developers in early tasks to resolve doubts about the environment or tools.
- Help resolve complex merge conflicts and provide strategies to avoid them in the future.
- Provide preconfigured scripts or tools to help developers with repetitive tasks, such as setting up local environments or validating code.
- Be available to the team via meetings, chats, or tickets, ensuring that questions are resolved quickly.
- Help developers correctly configure their local environments so they can quickly start working with the repository code.
- Validate and adjust their roles to ensure necessary access without compromising repository security.
- Investigate the error, adjust the workflow configuration, and offer recommendations to avoid similar problems.
Development Team Support ( API Developers and engineers)
Direct Support
- Coordination: Ensure that assigned tasks are completed within the established deadlines.
- Technical Support: Provide direct assistance in resolving technical issues, Python code errors, or workflow failures.
- Facilitation: Ensure they have the necessary tools and an appropriate work environment.
- Supervision: Monitor task progress and ensure operational continuity in case of absences.
- Communication: Communication will be direct and via Microsoft Teams:
- Group: A team channel to discuss general progress, notify blockers, or share project-related information.
- Individual: To address specific matters or resolve technical issues.
- Team Responsibilities The team must:
- Complete Tasks: Execute and supervise data ingestion and processing pipelines.
- Report Progress: Share a brief daily report in Teams for visibility and tracking.
- Document Knowledge: Keep key information about tasks and processes updated in the repository or assigned tools.
- Escalate Issues: Immediately report any bugs, blockers, or critical errors to receive support.
Availability of Tools
- Tool Management:
- Ensure they have: Remote or virtual machines, if needed.
- Updated Alteryx or required software licenses.
- Access to GitHub repository and other platforms.
- Local Configurations: Assist in setting up local development environments (Python, libraries, connectors).
- Daily Reporting Procedure To maintain visibility of progress, developers should: • Send a daily message in Teams (group or direct, as applicable) including: Completed Tasks: A brief summary of completed work.
- Pending Tasks: Activities in progress and missing steps.
- Blockers: Problems requiring immediate attention or support.
- Supervision and Monitoring Daily Monitoring: • Review sent reports and updates in GitHub repository issues.
- Tool Audits: • Periodically check that assigned tools are working and available for the team