Position: Staff Data Scientist
Job Location: 702 SW 8th Street, Bentonville, AR 72716
Duties: Data Source Identification: Supports the understanding of the priority order of requirements and service level agreements. Helps identify the most suitable source for data that is fit for purpose. Performs initial data quality checks on extracted data. Data Strategy: Understands, articulates, and applies principles of the defined strategy to routine business problems that involve a single function. Model Assessment and Validation: Identifies the model evaluation metrics. Applies best practice techniques for model testing and tuning to assess accuracy, fit, validity, and robustness for multi-stage models and model ensembles. Data Visualization: Generates appropriate graphical representations of data and model outcomes. Understands customer requirements to design appropriate data representation for multiple data sets. Works with User Experience designers and User Interface engineers as required to build front end applications. Presents to and influences the team and business audience using the appropriate data visualization frameworks and conveys clear messages through business and stakeholder understanding. Customizes communication style based on stakeholder under guidance and leverages rational arguments. Guides and mentors junior associates on story types, structures, and techniques based on context. Understanding Business Context: Provides recommendations to business stakeholders to solve complex business issues. Develops business cases for projects with a projected return on investment or cost savings. Translates business requirements into projects, activities, and tasks and aligns to overall business strategy and develops domain specific artifact. Serves as an interpreter and conduit to connect business needs with tangible solutions and results. Identifies and recommends relevant business insights pertaining to their area of work. Tech. Problem Formulation: Translates/co-owns business problems within one's discipline to data related or mathematical solutions. Identifies appropriate methods/tools to be leveraged to provide a solution for the problem. Shares use cases and gives examples to demonstrate how the method would solve the business problem. Analytical Modelling: Selects appropriate modelling techniques for complex problems with large scale, multiple structured and unstructured data sets. Selects and develops variables and features iteratively based on model responses in collaboration with the business. Conducts exploratory data analysis activities (for example, basic statistical analysis, hypothesis testing, statistical inferences) on available data. Identifies dimensions and designs of experiments and create test and learn frameworks. Interprets data to identify trends to go across future data sets. Creates continuous, online model learning along with iterative model enhancements. Develops newer techniques (for example, advanced machine learning algorithms, auto ML) by leveraging the latest trends in machine learning, artificial intelligence to train algorithms to apply models to new data sets. Guides the team on feature engineering, experimentation, and advanced modelling techniques to be used for complex problems with unstructured and multiple data sets (for example, streaming data, raw text data). Model Deployment and Scaling: Deploys models to production. Continuously logs and tracks model behavior once it is deployed against the defined metrics. Identifies model parameters which may need modifications depending on scale of deployment. Code Development and Testing: Writes code to develop the required solution and application features by determining the appropriate programming language and leveraging business, technical, and data requirements. Creates test cases to review and validate the proposed solution design. Creates proofs of concept. Tests the code using the appropriate testing approach. Deploys software to production servers. Contributes code documentation, maintain playbooks, and provide timely progress updates.
Minimum education and experience required: Master’s degree or the equivalent in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or related field plus 2 years of experience in analytics or related experience; OR Bachelor’s degree or the equivalent in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or related field plus 4 years of experience in analytics or related experience; OR 6 years of experience in analytics or related experience.
Skills required: Must have experience with: Developing and deploying machine learning models for predictive analytics, classification, clustering, and anomaly detection using machine learning algorithms like linear regression, logistic regression, support vector machines (SVM), decision trees, Random Forest, XGBoost, Generalized Additive Models and neural networks; Utilizing statistical techniques such as hypothesis testing, ANOVA, regression, and time series analysis; Utilizing Python (including libraries such as Pandas, NumPy, Scikit-learn, TensorFlow, and Keras) and NLP libraries and frameworks (NLTK, SpaCy, and Transformers); Performing data extraction, manipulation, and management using Hive, MySQL, Couchbase, MongoDB, and BigQuery; Utilizing Hadoop, Spark, and Cassandra; Developing enterprise-level solutions using object-oriented programming in Python, Java, and C++, with R and SAS for statistical analysis and modeling; Developing and implementing computer vision algorithms to enhance image quality and build and deploy OCR systems to extract text from enhanced images; Designing and implementing REST API using Java Spring Framework and Python Flask to work as backend for AngularJS and React application; Utilizing Natural Language Processing (NLP), including creating and using word embeddings (Word2Vec, GloVe, FastText); using language models like BERT, GPT, and Transformer-based models for various NLP tasks including text generation, summarization, and translation; and applying deep learning architectures such as RNNs, LSTMs, GRUs, and Transformers for complex NLP task; Building responsive and interactive user interfaces using HTML, CSS, JavaScript, React and AngularJS; Deploying web applications utilizing DevOps methodologies, Docker containers, and HTTPS server deployment on OneOps platform. Employer will accept any amount of experience with the required skills.
Wal-Mart is an Equal Opportunity Employer.
Company:
WalmartEmployee Type:
Full timeLocation:
United StatesSalary:
$ 63360 - $ 147840