Python / Data Science.

Company: Telecomm Software
Location: Raleigh, North Carolina, United States
Type: Full-time
Posted: 31.AUG.2021
< >

Summary

Data Scientist / Data Engineer Data Science Role -work on data ranging from geo spatial, social and text to quantitative factor modeling. D...

Description

Data Scientist / Data Engineer Data Science Role -work on data ranging from geo spatial, social and text to quantitative factor modeling.

Data Engineer Role for data lake / Data Brick developement

Candidate should be passionate about data and insights. The perfect candidate will have a background in a quantitative or technical field, will have experience building AI ML models and pipelines on large datasets, and will have experience in evidence based insight generation.

Exploratory Analysis
• Ability to explore a data set to evaluate the quality of the data across datasets in geo spatial, text, statistical etc.
• Build reports showcasing various reporting metrics
• Showcase various trends and insights via descriptive analytics
Model Building
• Perform feature engineering leveraging various techniques in statistics, ML, geo etc.
• Build various models and evaluate against each other to find the best model with goodness of fit
• Model validation
• Model execution pipeline with model validation, model monitoring, model scoring, model decay and retraining Data Infrastructure
• Work in Azure, Databricks and mlflow with ability to do some Hadoop and Hive
• Have a working knowledge of Spark You Offer Bachelor's degree in Computer Science or equivalent
Advanced degree in a technical or quantitative stream preferred
Knowledge of Python , Scala or Java
5+ years of professional experience and discipline in building Machine Learning models
5+ years of experience in Statistics and Data Science techniques like exploratory analysis, feature engineering and ML techniques like clustering, regressions, classifications etc.
Experience with machine learning packages such as TensorFlow, PyTorch, Keras, Scikit-Learn, NumPy, SciPy, Pandas, StatsModels, Spark ML
Exposure to Machine Learning techniques like hyper parameter tuning, model validation, model serving, model monitoring, retraining etc. (Machine Learning pipeline)
Experience with machine learning lifecycle tools (i.e. mlflow, kubeflow)

Daily Rate at $720 Corp

- provided by Dice

 
Apply Now

Share

Flash-bkgn
Loader2 Processing ...