Data Science Challenge

solution:

Introduction

Welcome to the HAMS Data Science Challenge! With this challenge we want to test your ability to build advanced probabilistic data models and to interpret their results.

Task

The challenge is set in the context of performance marketing and we want you to apply a bayesian mixed-media model (MMM) on our test dataset an interpret the insights from the model. Build the MMM with the latest PyMC package (https://www.pymc.io/).

Context

We have a company X which runs an online shop. X advertises on seven different paid channels and has weekly costs in them. Marketing actions have usually not an immediate effect, ads and campaigns in one week influence usually sales in the coming weeks. Hence, the company is of course super interesting to understand how effective different channels are. In terms of channels think of TV, radio, billboards, but also online advertisement such as Google Ads, Facebook Ads, etc. So different channel can be expected to target different audiences at different times, and hence will have very different effects on future sales.

This is of course the perfect setting for an ambitious Data Scientist. ;-) Modelling the uncertainty and the delayed effects is of course key. We are working heavily with Bayesian models and would like here to test your understanding and approaches in this setting.

Some hints

You will need to model the spend carry over effect (adstock).
No need (for now) to overcomplicate the adstock shape effects with saturation or diminishing returns.
Seasonality & trend might be interesting to be included in your model.

Dataset MMM_test_data.csv

start_of_week: first day of the week
revenue: revenue generated in this week from sales
spend_channel_1..7: marketing cost spend in this week in channel 1..7

Questions

How do you model spend carry over?
Explain your choice of prior inputs to the model?
How are your model results based on prior sampling vs. posterior sampling?
How good is your model performing? How you do measure it?
What are your main insights in terms of channel performance/ effects?
(Bonus) Can you derive ROI (return on investment) estimates per channel? What is the best channel in terms of ROI?

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
report		report
solution		solution
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Science Challenge

Introduction

Task

Context

Some hints

Dataset MMM_test_data.csv

Questions

Deliverable

Note: We don't expect you to build THE perfect analysis and report here.

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Data Science Challenge

Introduction

Task

Context

Some hints

Dataset MMM_test_data.csv

Questions

Deliverable

Note: We don't expect you to build THE perfect analysis and report here.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages