This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

Last update: Jan 11, 2022

Overview

Data-Science-Intern-Challenge

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

Summer 2022 Data Science Intern Challenge

Please complete the following questions, and provide your thought process/work. You can attach your work in a text file, link, etc. on the application page. Please ensure answers are easily visible for reviewers!

Question 1: Given some sample data, write a program to answer the following: click here to access the required data set

On Shopify, we have exactly 100 sneaker shops, and each of these shops sells only one model of shoe. We want to do some analysis of the average order value (AOV). When we look at orders data over a 30 day window, we naively calculate an AOV of $3145.13. Given that we know these shops are selling sneakers, a relatively affordable item, something seems wrong with our analysis.

Think about what could be going wrong with our calculation. Think about a better way to evaluate this data.

Answer: The wrong average was calculated using this method: total of all order values/ number of order_values. This is wrong because the formula didn't consider the fact that an order can have multiple items. I have tried to explain the problem with code. Click Here to view it.

What metric would you report for this dataset?

Answer: The correct approach would be to divide the total of all order_values by the sum of total_items. By following this method, we would consider the fact that an order can have multiple items.

What is its value?

Answer: $357.92

Question 2: For this question you’ll need to use SQL. Follow this link to access the data set required for the challenge. Please use queries to answer the following questions. Paste your queries along with your final numerical answers below.

How many orders were shipped by Speedy Express in total?

Answer: 54

What is the last name of the employee with the most orders?

Answer: Peacock

What product was ordered the most by customers in Germany?

Answer: Boston Crab Meat. This product was ordered 160 times in total.

Click here to check the sql queries.

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

Related tags

Overview

Data-Science-Intern-Challenge

Summer 2022 Data Science Intern Challenge

Owner

This is the implementation of "SELF SUPERVISED REPRESENTATION LEARNING WITH DEEP CLUSTERING FOR ACOUSTIC UNIT DISCOVERY FROM RAW SPEECH" submitted to ICASSP 2022

Road Crack Detection Using Deep Learning Methods

StyleGAN2-ADA - Official PyTorch implementation

Massively parallel Monte Carlo diffusion MR simulator written in Python.

VoxHRNet - Whole Brain Segmentation with Full Volume Neural Network

scalingscattering

SEJE Pytorch implementation

Emulation and Feedback Fuzzing of Firmware with Memory Sanitization

RetinaFace: Deep Face Detection Library in TensorFlow for Python

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

[ICLR2021] Unlearnable Examples: Making Personal Data Unexploitable

Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

FcaNet: Frequency Channel Attention Networks

Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022

This repository contains the accompanying code for Deep Virtual Markers for Articulated 3D Shapes, ICCV'21

Additional code for Stable-baselines3 to load and upload models from the Hub.

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space

Imposter-detector-2022 - HackED 2022 Team 3IQ - 2022 Imposter Detector