Job Summary
An information and analytics company has an open position for a Remote Data Pipeline Senior Software Engineer in Raleigh.
Core Responsibilities of this position include:
- Solving the scalability issues of data ingestion pipelines for the search backend of company's product, dramatically improving both velocity and consistency of ETLs from data lake to Solr
- Bringing their own perspective on how to solve a variety of internal and external opportunities
- Completing complex bug fixes
Must meet the following requirements for consideration:
- Minimum 2 years of developing and maintaining ETL pipelines in Spark / Hadoop / Kafka
- Minimum 2 years of experience in Java or Scala
- Minimum 2 years of scaling search server clusters to accommodate increasing traffic to meet specific performance requirements
- Experience parsing data from XML documents
- Experience in data modeling, design and manipulation, optimization, and best practices
- Minimum 5+ years of Software Engineering experience BS Engineering/Computer Science or equivalent experience