
Sr. Data Scientist en Insight International Limited
Madrid, MAD
Acerca de la oferta
Data Scientist Job Description:
Qualifications for Data Scientist:
- Strong problem solving skills with an emphasis on product development.
- Experience using statistical computer languages (Python, R, SLQ, etc.) to manipulate data and draw insights from large data sets for AML.
- Experience working with and creating data architectures.
- Knowledge of a variety of machine learning techniques (clustering, decision tree learning, artificial neural networks, etc.) and their real-world advantages/drawbacks.
- Knowledge of advanced statistical techniques and concepts (regression, properties of distributions, statistical tests and proper usage, etc.) and experience with applications.
- Excellent written and verbal communication skills for coordinating across teams.
- A drive to learn and master new technologies and techniques.
- We’re looking for someone with 3-7 years of experience manipulating data sets and building statistical models, Mathematics, Computer Science or another quantitative field, and is familiar with the following software/tools:
- Coding knowledge and experience with several languages: Python, R, SLQ, C, C++, Java, JavaScript, etc.
- Knowledge and experience in statistical and data mining techniques: GLM/Regression (Generalized Linear Models in Python), Random Forest, Boosting, Trees, Text Mining, SNA (Social Network Analysis), etc.
- Experience Data Analysis (analyze data quality, what data is missing and how it can be obtained from internal or external data) in AML world.
- Experience querying databases and using statistical computer languages: Python, SLQ, etc.
- Experience using API/MS, web services: S3, Spark, Redshift, etc.
- Experience creating and using advanced machine learning algorithms and statistics: Regression, Simulation, Scenario Analysis, Modeling, Clustering, Decision Trees, Neural Networks, etc.
- Experience analyzing data from 3rd party providers: Factiva, Google Analytics, Site Catalyst, Coremetrics, Adwords, Crimson Hexagon, Facebook Insights, etc.
- Experience with distributed data/computing tools: Hadoop, Hive, Spark, MySQL, Cloud, Dockers, Kubernetes, Github, etc.