Job Description
High level activity description: Regexes will be used in Spark Structured Streaming processes that read data from Kafka. (Approx. 1,000,000 records / 1s.). After processing (enrichment, categorization, filtration), they are loaded into Elasticsearch.
- Create and iterate on efficient regular expressions through identification of patterns and high priority items
- Manage existing regular expression pool, updating and modifying as necessary
- Search for optimizing the business processes with the power of data
- Analyze data and identify possible use and application cases of data analytics in the business
- You will have the opportunity to work with variety of data mining and data analytics.
Ideal Candidate
- University degree in the field of information technology
- You are excited of Big Data and Data Analytics, or you already have first working experience in that
- Experience with data operations, including implementing regular expression (RegEx) and variables
- Happy / interested to learn Spark, Scala, Hadoop, Ariflow, Kafka, ElasticSearch (or experience in this area would be a plus)
- You are a Team Player, are open for international environment and be able to travel occasionally within Europe
- You are an analytical thinker, have an affinity for structure, grasp things fast and work systematically.
- 0 – 2 years development experience
- Fluent written and spoken English, German skills beneficial
- Creativity, dedication and “can do” attitude