Data Scientist

Summary

Predicted customer confidence score ranged from 0 to 1. 1 being the highest probability of a lead being converted to a customer to target those customers for follow up.
Imported market data collected from website and various ad agencies and stored in sequel server database into python.
Cleaned data and handled missing data with random forest algorithm and other techniques.
Developed features(Feature Engineering) from the processed data by creating various user defined functions. Created calculated feature columns and train model with mini batching.
Used over sampling technique to up sample low occurrence target values to be learned more by the model.
Developed train and test data by using stratifying technique to split data into same proportion of high and low occurrence target values. Trained random forest, neural network and logistic regression models with train data and calculated performance between each model on test data.
Used k-fold cross validation on train data to minimize variance and predict better results on test data.
Explored grid search cross validation to tune hyperparameters to find best parameters. Tested various performance metrics like accuracy,F1-score,Precision and Recall and used Recall to calculate the performance.

Expectations

Building pipelines to analyze data, feature engineering, building predictive models etc

Employment Preferences
Expected Base Salary

**0,000 USD

Expected Hourly Rate

** USD

Academic Degree
Experience

Total Professional Experience

5 years

Startup Experience

no experience

Big-Tech Companies

no experience

Enterprise Experience

5 years
Contact Candidate

Contacts

Send a connection request to the candidate to get their contact details.

Contact Candidate