THE PURPOSE:The Data Engineer works within the Data Solutions organization on critical reporting, visualization, and analysis initiatives.
Reporting spans from custom ad-hoc requests to scheduled jobs to supporting our growing data warehouse, and building our future cloud analytics platform.
The developer must be able to communicate to business users the exact scope of metrics as well as the confidence and quality of the data in reports.THE ROLE:Work directly with the business users to understand the reporting needs and lead business users to practical solutionsHelp translate business requirements into specification documents to track and perform analysis of new and existing site featuresUnderstand the necessity of data quality and requirement for confidence of accuracy of any reportsDevelop/monitor/maintain new reports, dashboards, visualizations, procedures, data structures and databasesDesign data pipelines and maintain data pipelines in cloud or on-premise environmentsDesign data schema, perform data transformations, enrichments, and manipulations with efficiency and reusability in mindPlanning, conducting and directing the analysis of complex business problems and projectsTHE CANDIDATE:· Understand data structures and algorithms.
Understanding of basic statistics (confidence intervals, statistical significance, etc)Experience in working with large size data sets (Billions of rows/Petabytes of data)Experience in working with various data sources (ODBC, flat files, etc)Experience working with and designing complex data schemasStrong skills in SQL, Java and/or PythonExperience with SQL query performance optimizationStrong skills Experience with Apache Big Data Frameworks (Hadoop/EMR/Databricks, Spark, Hive)Strong experience with Spark performance optimization and troubleshootingExperience with Kafka and event driven architecturesFamiliarity with workflow scheduling/orchestration tools (Airflow, Jenkins)Experience with AWSExperience with Tableau and or other Self Service Analytical tools.Implemented Redshift, Snowflake, Azure Data Warehouse, ADLS, S3, Kafka, Presto, EMR, Databricks, or Data Lake Architecture in one or more public clouds in a Production Large Scale environment.To Be Successful You Will Be:Highly motivated with a great attitude and desire to dive into raw data to understand trends in behavior to find insightsExcellent at multitasking who can execute multiple requests and reports under tight timelinesInquisitive, self-starter, able to work autonomouslyAble to work in a fast-paced dynamic startup like environmentDetail-oriented tactician who strives for perfectionStrong verbal and written communication (and listening) skillsExcellent reading comprehension and attention to detail.Strong problem-solving skillsStrong documentation skills as you code (Jira, Confluence)As a Data Engineer, your day-to-day tasks will include:Helping us leverage large-scale data stores and data infrastructure by building out data pipelines, streams, and utilities in Spark and other technologies for feedback to our business systems, partners, or usersDeveloping robust, low latency and fault tolerant pipelines to support business critical systemsAggregating key metrics for business partners to inform key decisionsWorking with cloud technologies to build and deploy your applicationsEnvironmentCan work effectively on a small and nimble team, no trouble context-switchingEducationB.S./M.S.
in Computer Science or Computer Engineering or 3+ years of equivalent experience
How to Apply
Please follow the application procedure at stackoverflow.com for more info.