
Data Engineer Architect III
- Hybrid
- Cincinnati, Ohio, United States
- Information Technology
Cloudera NiFi expert needed for ETL/ELT pipeline optimization, Python scripting, and data integration. Strong focus on performance tuning, distributed systems, and scalable architecture.
Job description
Job Summary
We are seeking an experienced Data Engineer/Architect with strong expertise in Cloudera NiFi, data pipeline optimization, and ETL/ELT processes. This role involves assessing, enhancing, and supporting scalable data integration solutions.
Key Responsibilities
NiFi Platform Assessment & Optimization
Evaluate existing Cloudera NiFi flows for performance, scalability, and maintainability
Identify improvements to simplify architecture and reduce operational overhead
Recommend best practices, reusable patterns, and standards
Data Integration & Pipeline Support
Develop and enhance ETL/ELT pipelines using NiFi
Integrate data from databases, APIs, filesystems, streaming platforms, and cloud storage
Troubleshoot pipeline failures, latency, and performance issues
Implement monitoring, error handling, and recovery mechanisms
Python & Custom Development
Use Python for data transformation, automation, and scripting
Build integrations between NiFi and external systems
Governance, Security & Operations
Ensure secure data movement with authentication, authorization, and encryption
Support deployment, versioning, and operational processes
Align platform changes with compliance and operational standards
Advisory & Collaboration
Act as a technical advisor to data and platform teams
Document findings, recommendations, and improvements
Provide knowledge transfer and long-term support guidance
Job requirements
Required Skills
Hands-on experience with Cloudera NiFi in production
Strong understanding of ETL/ELT and data pipeline architecture
Proficiency in Python for scripting and automation
Experience with Linux and distributed systems
Ability to analyze and optimize existing data systems
Preferred Skills
Experience with Cloudera ecosystem (Kafka, HDFS, Hive, Iceberg)
Familiarity with CI/CD for data pipelines
Experience in monitoring and performance tuning
Exposure to cloud or hybrid data environments
Engagement Expectations
Hands-on, deliverables-focused role
Ability to work independently and collaborate with teams
Strong communication and documentation skills
or
All done!
Your application has been successfully submitted!
You've already applied for this job
We appreciate your interest in this position. Unfortunately, you have already applied for this job.
