Data Science at Companies
Why does Airbnb have a data scientist on every team? What did it take to build out Thumbtack's data infrastructure? How do Twitch data scientists convince execs to embrace data-informed decision making? Get behind-the-scenes perspectives on how data teams at actual companies tackle questions of building infrastructure, scaling analytics, attaining buy-in, and structuring teams.
Do Data-Driven Companies Actually Win?
Some gut-based judgements on the effectiveness of data.-benn.substack
Eight Archetypes of Data Citizens
The question for data teams is: what sort of attitudes do you want to cultivate?-Data People Etc.
Are We Human, or Are We Vendor?
We can talk about the tools, we can talk about the work, or we can take the third option.-benn.substack
Building More Effective Data Teams Using the JTBD Framework
The way your team drives impact to the business is through improved decision-making- both for your company and your customers. But it can be hard to take that mission and put it into practice.-Locally Optimistic
The Modern Data Stack Guide for 2022
We’re not here to throw more lingo in this ever-growing pot of modern data stack gumbo...but we do want to help you understand what the modern data stack is, how it works, and its benefits for companies of different sizes.-Mode Blog
Why Do People Want to Be Analytics Engineers?
The job nobody wanted is now the job we can’t get enough of.-benn.substack
Don’t Just Run Your Data Team Like a Product Team, Run It Like a Company That Needs to Scale
A minor change in framing.-The Data Leader's Survival Guide
iSnowflake
It’s not the App Store. It’s the iPhone.-benn.substack
Awesome Data Leadership
A curated list of posts, videos, and articles on leading a data team should be useful for leadership at the middle-management, Director/VP, or C-suite level, for organizations both big and small.-Roni Kobrosly
Stakeholders: The Most Important Relationship for Analysts
Usually there are one or two really important stakeholders in an analyst’s work. You know the ones. Here are some do’s and don’ts for fostering a great relationship.-Inside Data by Mikkel Dengsøe
Microsoft, Google, and the Original Purple People
Google has the products, technology, and capital to dominate the data industry. Why aren’t they?-benn.substack
All Your Database Are Belong to Us
The machines are coming for our jobs, and…they might actually help? Plus, free advice for Steve Kerr and Ime Udoka.-benn.substack
Accidentally Trapping Ourselves With Stats
The insignificance of significance.-Counting Stuff
Using Data to Navigate: Data Leaders On Leading Companies Through Economic Uncertainty
In times of uncertainty, we search for answers—and data holds a lot of answers.-Mode Blog
The Technical Pay Gap
The culture we build is the culture we buy.-benn.substack
Locally Optimal
How much value is really created by analyzing data?-Casual Engineering
The Best Decision Is One
As data folk, we’re not here to help organizations make better decisions; we’re here to create better results.-benn.substack
The Case for Consolidation
The data world should pay more attention to Microsoft.-benn.substack
Hiring Data Scientists With Intention
Focus on these three areas to more inclusively hire data scientists.-The Data Leader’s Survival Guide
Has SQL Gone Too Far?
The case for better business models.-benn.substack
Data Salaries at FAANG Companies in 2022
Salary is a sensitive topic. This post shines some light on data compensation using—you guessed it—data.-Mikkel Dengsøe
The End of Big Data
Databricks, Snowflake, and the end of an overhyped era.-benn.substack
Thinking About Evolving Your Data Stack
When you’re trying to decide which data tools to use, start not with the data sources, but with the business outcomes you want to achieve.-The Data Leader’s Survival Guide
Startups Shouldn’t Care About Revenue
And data teams should make sure they don’t.-benn.substack
Work Like an Analyst
The divisions of labor between analysts and everyone else are fading.-benn.substack
Data Team Branding
“You may not think about it consciously every day, but your hiring practices, promotion cycles, and ultimately your team’s success are all influenced by how you ideally want the rest of the company to perceive your team.”-The Data Leader’s Survival Guide
Rent the Runway’s First Fix-It Week
While their engineering counterparts set aside more time in every sprint to tackle tech debt, Rent the Runway’s data team opted to wrangle their issues in one fell swoop.-Dress Code
The Economics of Data Businesses
How data businesses start, and how they keep going, and growing, and growing.-Pivotal
Service Pressure
Data teams aren't service organizations, but we can learn a lot from those that are.-benn.substack
Planning Quantitative Work
When someone says “I need this analysis by next Friday,” what do you do?-Counting Stuff
Delegation is a Superpower
“Delegation is necessary soil for growth – for your own growth, for those around you, and for your organization. To grow into a new phase, you need to evolve your established responsibilities and make space for new ones.”-LeadDev
Delirium
We talk a lot about what data can borrow from software engineering. But on the biggest issue of all, the technology industry has a lot to learn from the analytics community.-benn.substack
Estimating the Geographic Area of a Real Estate Agent
This in-depth post shows how Compass uses Kernal Density Estimation to calculate where real estate agents work, so that they can locate the right agent for their prospective home buyers.-AI@Compass
Data is for Dashboards
Defining metrics and creating dashboards isn’t banal busywork that gets in the way of “the more rewarding aspects” of analysts’ jobs. It’s one of the most important things that we do.-benn.substack
Why So Many Data Scientists Quit Good Jobs at Great Companies
A look at why the "sexiest job of the 21st century" has lost its appeal.-Start It Up
Does Data Make Us Cowards?
There’s a thin line between being analytical and being afraid.-benn.substack
The Missing Analytics Executive
We should redefine the role of the chief data officer. For our companies, for our careers, and for ourselves.-benn.substack
A Method for Measuring Analytical Work
Our only job should be to make people more decisive.-benn.substack
What Data Folk Were Saying About Zillow
In early November, Zillow announced it was winding down its iBuying business because of massive losses. This prompted many discussions on Twitter about the data science team’s role in iBuying’s failure and what machine learning is good and not good for in the industry.-Counting Stuff
Data and the Almighty Dollar
The data ecosystem is booming. The data economy has some things to figure out.-benn.substack
Universal Holdout Groups at Disney Streaming
“The idea of a universal holdout group is to hold back a randomly-sampled small percentage of users from all product changes for a period of time. This allows us to then compare metrics between the users who receive the production experience and those users who are held back from any product changes.”-Disney Streaming
Our Take On Prescriptive Analytics and How We Used it in 2020
Being able to quickly leverage data via prescriptive analytics gives businesses the advantage needed to tackle any unprecedented uncertainty, whether it’s a pandemic, a sudden decline in membership, or a surprise uptick in product demand.-Mode Blog
The Future of Operational Analytics
A better way to make daily decisions is already here—just not at work.-benn.substack
BI is Dead
How an integration between Looker and Tableau fundamentally alters the data landscape.-benn.substack
Chasing Ghosts
The data community talks a lot about tools, but what about the work of analysis itself? Because most corporate analyses are proprietary, it’s hard to learn from others’ work and improve.-benn.substack
The Intergalactic Data Stack
Falling further into the BI black hole—and charting a course out.-benn.substack
How I Got a Job at DeepMind as a Research Engineer (without a Machine Learning Degree!)
Lots of good insight in here for structuring your own ML learning curriculum, how the DeepMind hiring pipeline is structured, and how to get referrals.-Aleksa Gordić
Who Is “The Community?”
Even though the analytics community is generally thought of more welcoming and inclusive than other tech circles, Black folks remain woefully underrepresented. And we need to start talking about it.-benn.substack
Is BI Dead?
By defining BI as just self-serve, we shortcut what it could be, no matter how good that self-serve is. A better, more universal BI tool would combine both ad hoc and self-serve workflows, making it easy to hop between different modes of consumption.-benn.substack
Why Data Scientists Shouldn’t Need to Know Kubernetes
Where does the “full-stack” data scientist expectation come from? And how might data scientists own the stack, with the help of a good infrastructure abstraction tool?-Chip Huyen
Quasi-mystical Arts of Data & the Modern Data Experience
If we should think of data as a product, then we should think of data as a software product.-The Analytics Engineering Roundup
The Data OS
In 2021, Y Combinator funded 100 data companies. While some folks say there are “too many data tools,” instead of calling for a culling of said tools, we should try to figure out how to better manage the tools we have, and the shiny future ones we might want.-benn.substack
The Modern Data Experience
The modern data stack isn't enough. We have to create a modern data experience.-benn.substack
The Unspoken Gerrymandering of the Modern Data Stack
The modern data stack doesn’t need a BI bucket and a data science bucket; it needs a unified consumption layer. To do our job well, we have to overcome the technical division, not be defined by it.-benn.substack
The Third Rail
Analysts are like honeydew, victims of the soft bigotry of low expectations.-benn.substack
How To Build An In-House Data Team
Which comes first—the data engineer or the data analyst? And what’s the right team structure?-Forbes
Building a Data Team at a Mid-stage Startup: A Short Story
Gather round, folks, and hear a tale of fragmented data and unclear expectations, of organizational challenges and data-demotivated culture.-Erik Bernhardsson
Self-serve Is a Feeling
As data teams, we chase the self-serve experience that we think we’re supposed to build. We should be more critical of that, and chase the self-serve experience that makes us and our customers feel most at home.-benn.substack
Analytics Is at a Crossroads
The world is full of great analysts. Will we have the courage to go looking for them?-benn.substack
Data's Big Whiff
How to escape our dashboard rat race, learn from data, and love the job again.-benn.substack
What the Heck is a Data Mesh?!
“What we need... is for organizations to treat data with the same care that they treat any other public facing API.”-Chris Riccomini
When Are Templates Going to Happen?
The core assumption behind templates is that everyone's data is mostly the same, and everyone's business problems are mostly the same. That's only half true.-benn.substack
DataHub: Popular Metadata Architectures Explained
It doesn’t matter how cost-efficient or scaleable your data warehouse is if your stakeholders are wasting time trying to find the right dataset to perform analysis.-LinkedIn Engineering
The Self-serve Shibboleth
“The problem self-serve is meant to solve—that there are more questions than analysts, and we don’t have time to answer them all—is still a problem, even if self-serve is a lousy solution.”-benn.substack
Analytics Engineering Everywhere
While we were being dazzled by data scientists, another role started to emerge, one that has a lot of impact for smaller organizations where data science often falls flat.-Jason Ganz
How Product Leaders Can Make Better Decisions With Iterative Analytics
This post walks through the process of answering a question every product leader has had: “Should the product team spend time fixing bugs or building new features?”-Mind the Product
Analytics Is a Mess
And a mess isn’t a bad thing. Data isn’t objective, and analysis isn’t structured. It’s creative and disorganized. So don’t stifle it; just have a plan for tidying up later.-benn.substack
Run Your Data Team Like A Product Team
“When a data team emerges in a company as an afterthought, they often end up being built like service-based departments with a “submit a ticket with a question, get a very specific answer” mindset. Data folks who are bound to this model rarely spend time being proactive. Without intentional space, they are unlikely to be anything more than ticket closers.”-Locally Optimistic
The Most Important Role You’re Not Hiring for Your Data Team: The Information Architect
When you think of information architecture, your first thought might be “websites.” But it’s just as important for designing dashboards and data viz.-Nightingale
Can College Predictive Models Survive the Pandemic?
“The last year has revealed just how important it is that we fully understand the ‘how’ and the ‘why’ of the predictions these tools make about ‘who’ is most likely to enroll or may need additional services to help them succeed at an institution.-EdSurge
Why Is Self-serve Still a Problem?
“Analysts and non-analysts use data in structurally different ways. By conceptualizing self-serve BI as a simplified means for doing an analyst’s job, we’re not only making the self-serve problem too hard—we’re solving the wrong thing altogether.”-benn.substack
Building Powerful Data Teams: On Investing in Junior Talent
“As a Data Director and manager of a 5-person data team, I'm obsessed with how we can more efficiently, and with a lot less shame, go through that cycle of learning from mistakes.-Brittany Bennett
Uber’s Journey Toward Better Data Culture From First Principles
It’s not all about scale.-Uber Engineering Blog
Data Science at an Analytics SaaS Startup
Working at a SaaS company was a new challenge for Mode’s Director of Data Science. Here are five key lessons from his past roles in P2P commerce, traditional commerce, gaming, and ad tech that he puts applies to SaaS.-Towards Data Science
Four Steps To Make Your Organization More Data-Proficient In The Coming Decade
Everyone—data scientists, marketing managers, CEOs, all—will need to deepen their data skills.-Forbes
Spatial Information and the Legibility of Urban Form: Big Data in Urban Morphology
An interesting look at how data science can be used in urban planning.-International Journal of Information Management
Growing and Running Your Data Science Team
“An effective data science team is like the goose that lays the golden eggs. Sometimes, we focus too much on the eggs (i.e., results)—how can we get more, faster? As a result, the goose (i.e., team) can get neglected.”-Eugene Yan
One Woman’s Work Using Data Science to Break Down Barriers – and Breaking Down Barriers to Data Science
This profile of Kim Eng Ky highlights examples of how data science can be used for improved social outcomes in both the private and public sector.-The Good AI
Data Meta-Metrics
How do we communicate confidences and doubts about data to a non-technical audience?-Haystacks
(At Least) 5 Ways Data Analysis Improves Product Development
Traditional BI tools are great at answering static questions, but not the big-picture questions that direct decisions on what big swings to take in your product roadmap.-Mode Blog
Designing Data Science Tools at Spotify
A product designer explains her process for creating usable, clean tools to help data scientists make sense of Spotify’s millions of users.-Spotify Design
Building Diverse Data Teams
These workshop slides present real strategies for creating inclusion and equity in your hiring process. One way to start? Throw out your referral program and only incentivize referrals for under-represented minorities.-Ayodele Odubela
State of Data Science and Machine Learning 2020
Cloud computing is on the rise. Scikit-learn is the most popular machine learning tool. And data scientists are still overwhelming young and male.-Kaggle
Election Night with Biden’s Data Guru
This is a fascinating look into the world of political data and analytics. Chief Analytics Officer Becca Siegel was a critical part of Biden’s campaign, raising early alarm bells about unrealistically friendly polling data and pushing for more appearances in Georgia during the final stretch.-Intelligencer
Lessons Data Teams Can Learn From the 2020 Election Polls
The question of whether we should produce political polls is akin to asking whether we should use be using imperfect data to inform business decisions.-Mode Blog
When Good Data Analyses Fail to Deliver the Results You Expect
A cautionary tale of a data analysis, a dashboard, and a huge waste of resources.-Narrator
What’s It Like to Be an Applied Scientist at Amazon?
What’s an average day (or non-average day) look like building machine learning systems to help customers?-Career Fair
An Intake Form for Data Requests
When a newly formed data team started getting data requests from all over the company via Slack, email, meetings, and more, they decided to centralize calls for help with a form. The info they receive sets them up for great kickoff conversations with stakeholders.-Haystacks
What if You Were an Evil Data Scientist?
Generally, data folks are well meaning, but what if they weren’t? This thought experiment is good reminder of how much power data scientists and analysts wield and the responsibility that comes with it.-Counting Stuff
The Hardware Lottery
Does this sound familiar? A research idea wins because it works well with the available software and hardware and not because the idea is better than other research directions. Then you, my friend, have been playing the hardware lottery.-Sara Hooker
Don’t Tell Your Data Team’s ROI Story
“The truth is that if you’re trying to quantify your impact by yourself, you have already lost. Instead, the best way to tell the ROI story is for other people to tell it.”-Hex
Unpopular Opinion - Data Scientists Should Be More End-to-End
While this is frowned upon (too generalist!), I've seen it lead to more context, faster iteration, greater innovation—more value, faster.-Eugene Yan
Are Dashboards Dead?
“If you’ve been using the modern analytics stack for multiple years now, you’re likely starting to feel some pain. It’s probably starting to feel a little bit like the ‘wild west’ to you.”-dbt Blog
Monitoring Data Quality at Scale with Statistical Modeling
Based on historical data patterns, Uber’s Data Quality Monitor automatically locates the most destructive anomalies and alerts data table owners to check the source, but without flagging so many errors that owners become overwhelmed.-Uber Engineering
How Do Data Scientists and Machine Learning Engineers Work With One Another?
This thread offers a look into the wide range of ways companies structure their machine learning and data science organizations.-r/datascience
What I Love about Scrum for Data Science
“Despite my initial concerns (and violent objections), Scrum grew on me. Now, I find it almost indispensable.”-Eugene Yan
Does Stress Impact Technical Interview Performance?
These findings show that traditional technical interviews assess performance anxiety, not software skills. That means companies are missing out on hiring talented employees just because those programmers aren’t good at writing on a whiteboard and explaining their work out loud while coding.-NC State University
Data Analyst 3.0: The Next Evolution of Data Workflows
With the rise of cloud-native data warehouses and advancements in scalable inference methods, we’re now at the cusp of a third phase that not only affords better, faster processing of data, but also lets operational data analysts impact business decisions like never before.-Sisu
How Five Data Leaders Are Leading Teams Through the Pandemic
Starting with the transition of working from home, leaders from Lyft, Intercom, Greenhouse, Sisu, and Patreon walked us through stories of pivots, rapid user growth, new metric adoption, and managing competing priorities.-Mode Blog
COVID-19 Cases Are Rising, So Why Are Deaths Flatlining?
Some possible explanations for the case-death gap? Deaths lag behind cases. The typical COVID-19 patient is getting younger. Expanded testing means finding more cases, milder cases, and earlier cases.-The Atlantic
N=1: My Experience with Motherhood in Tech
“I would have loved to hear about this journey from a person similar to me—a woman in tech who cares deeply about her career and community as well as her family. Influencers can be great, but sometimes you just want to know how a regular person told their boss they were pregnant, and maybe what stroller they picked and why.”-Haystacks
10 Tips for Research and a PhD
“This advice is not just limited to PhD students. If you are an independent researcher, want to start a PhD in the future or simply want to learn, then you will find most of this advice applicable.”-Sebastian Ruder
From F2F to Remote
A stats professor shares how she adapted her undergraduate regression course when the class switched to remote learning mid-semester.-Maria Tackett
How I Wound Up Being a Quantitative UX Researcher
Quantitative UX researchers and data analysts share a similar base skill set. The main difference is how closely UX researchers work with the product team.-Counting Stuff
Reassessing Customer Health Scores in a Fluctuating Market
Within only a week or two of closing our office, we had a fully explorable, dynamic health scoring model leveraged across key stakeholders of Mode.-Mode Blog
Proxy Metrics
The former VP of Product Management at Netflix walks through how to define a product metric to prove or disprove your hypotheses and measure progress.-Gibson Biddle
Building Data Science Infrastructure
Super-practical, step-by-step advice on setting up the very first data science infrastructure and team in an organization.-Data Science Salon
An Analyst’s Perspective on COVID-19’s Economic Impact on Businesses
In a previous life, Mode's co-founder and president worked as an economic analyst. Here's his assessment of how businesses can brace for strong economic headwinds ahead.-Mode Blog
Cultivating Algorithms: How We Grow Data Science at Stitch Fix
In this interaction visualization, Stitch Fix presents the organizational structure, roles, and processes that define their way of working, and how it is different from the way data science operates at other companies.-MultiThreaded
How Mode Went Completely Remote in 36 Hours, and 7 Tips We Learned Along the Way
The combination of unified activity monitoring, digital accountability, and virtual togetherness processes allowed this largely non-remote group to transition eerily seamlessly into a remote workforce.-Mode Blog
How We Improved Data Discovery for Data Scientists at Spotify
What Spotify’s insights team learned as they iterated on an internal tool for sharing data and knowledge with employees.-Spotify Labs
The New Business of AI (and How It’s Different From Traditional Software)
“Many AI companies (and investors) are betting that this relationship will extend beyond just technology–that AI businesses will resemble traditional software companies as well. Based on our experience working with AI companies, we’re not so sure.”-Andreessen Horowitz
Lessons Learned Managing the Gitlab Data Team
Don't reinvent the wheel, plan for growth, and pick excellent tools.-Gitlab Blog
10x Data Scientist is Luckily Not "a Thing" Let's All Work to Keep It This Way
The 10x engineer is a toxic stereotype in the first place. It’s an especially unproductive way to think about data science.-Counting Stuff (https://counting.substack.com/)
Bayesian Product Ranking at Wayfair
With 25,702 shower curtains in their store, how does Wayfair pick the right three to recommend to a customer?-Wayfair Technology Blog
Doing Freelance Data Science Consulting in 2019
First tip: call yourself a consultant instead of a freelancer or contractor—it sounds fancier.-Ethan Rosenthal
A Dispassionate Examination of the Empirical Evidence Regarding Positional Punctuation in SQL
Let’s put the leading vs trailing comma debate to bed, once and for all. (But c’mon, trailing commas are obviously way better, right?)-Mode Blog
How to Solve a Business Problem Using Data
Reflect back on your grade school days... It turns out solving a business problem with data requires the same foundational skills as solving a word problem in grade school math class.-Little Miss Data
Data Project Checklist
Use this questionnaire to understand an organization’s context for developing data projects. It’s based on decades of projects across many industries, including agriculture, mining, banking, brewing, telecoms, and retail.-fast.ai
How to Demystify Skewed Data and Deliver Analysis
Try out these four data distribution charts to spot meaningful patterns, trends, or significant errors in your data.-Mode Blog
Want To Make Good Business Decisions? Learn Causality
This causality walkthrough uses an example that plagues many modern marketing teams: “Should we pay for branded search ads?”-MultiThreaded
dbplyr: A Path to More Inclusive Data Transformations at the ACLU
By using the Tidyverse’s dplyr to generate complex SQL code in their data warehouse, the ACLU analytics team is able to create reusable data workflows that everyone can benefit from—whether they’re using SQL, Python, R, or a BI tool.-ACLU Tech & Analytics
Tips for Data Science Interviews
A lot of the advice boils down to: put yourself in the shoes of the interviewer.-Becoming a Data Scientist
When Did Analytics Engineering Become a Thing? And Why?
Read about the market trends that gave rise to the newest role on modern data teams.-dbt Blog
How to Find Consulting Clients
If you’re thinking of striking out on your own as a data science consultant, your biggest fear is probably “How will I find work?”-Chris Achard
Choose Boring Technology
Adding technology to your company comes with a cost. How would you solve your immediate problem without adding anything new?-Dan McKinley
Coding Salaries in 2019: Updating the Stack Overflow Salary Calculator
It turns out DevOps engineers generally love their jobs and they make bank.-Stack Overflow
Modeling Conversion Rates and Saving Millions of Dollars Using Kaplan-Meier and Gamma Distributions
“Computing a conversion rate is often fairly straightforward and involves nothing more than dividing two numbers... [But] when there is a substantial delay until the conversion event, this analysis suddenly gets vastly more complex.”-better.engineering
Logs Were Our Lifeblood. Now They're Our Liability.
“If the first ten years of data science were all about collecting and analyzing everything, the second ten will be about how to be deliberate and selective about collected and analyzed data.”-Normcore Tech
How the Data Insights Team Helps Flatiron Build Useful Data Products
A look at how the first data hire at a healthcare startup scaled up to a group of Data Insights Engineers who are embedded with almost every product team.-Carlos Aguilar
Need Some Fashion Advice? Just Ask the Algorithm
Stitch Fix is using data to demystify one of the most elusive parts of fashion: how individual pieces of clothing relate to one another.-Wired
How Lyft Creates Hyper-Accurate Maps from Open-Source Maps and Real-Time Data
Lyft has fixed thousands of map errors in OpenStreetMap (OSM) in bustling urban areas.-Lyft Engineering
Selling Data Science
“Half of your job, regardless of what that job is, is being able to sell your work.-Normcore Tech
Devoted Health and Data Science with Chris Albon
Centralized data science team or matrixed? How do you choose whom to hire? How do you go from a Political Science Ph.D. to non-profit data science wizard?-Google Cloud Platform Podcast
The Care and Feeding of Data Scientists
Are you a data scientist manager who had the role thrust upon you? Feel like you don’t have the necessary toolset or role models or mentorship to do the job well? This e-book aims to fill that gap.-O’Reilly
The UX of Data
Generating data is easy. Making sure it’s reliable and widely accessible is hard. With a bit of human-centric thinking, teams can build shared understanding of why, what, and how to measure their work.-Lex Roman
Presto Infrastructure at Lyft
Learn about the query engine that supports thousands of dashboards, 1.5K weekly active users, and a couple million queries every month.-Lyft Engineering
Building Our Centralized Experimental Platform
Running an A/B test is easy. Screwing up an A/B test is even easier. Stitch Fix built their own tool to make tests consistent across teams and give visibility into every test being run in the company.-MultiThreaded
Models for Integrating Data Science Teams Within Organizations
Each model has its benefits and drawbacks. Take a gander and see what might best fit your needs.-Medium
What is Your “Data Science Team” Called in Your Organization?
A fun Twitter thread to peruse! There’s more variety in team names than you might expect.-Chris Riccomini
Models and Microservices Should Be Running on the Same Continuous Delivery Stack
The deployment needs for models and services are strikingly similar, so why are so few companies actually using a unified model and service deployment stack?-Chris Riccomini
Building Lyft’s Marketing Automation Platform
Lyft built an ML system that automatically finds the best potential users to target, allocates the right budget for each ad, and sets the right amount to bid on each platform to maximize that budget.-Lyft Engineering
You Probably Don't Need a Data Dictionary
Data dictionaries and data catalogs can end up being a large maintenance burden for little value and go out of date very quickly. Try these other approaches instead.-Locally Optimistic
Is it a good idea for a data scientist to take a position as a data engineer for a few years to build their software engineering skills?
Check out the replies in this thread if you're a data scientist curious about picking up data engineering skills.-Caitlin Hudon
The Agony and Ecstasy of Building with Data
This post on the pitfalls of A/B tests and data was written 6 years ago, but still holds up perfectly.-The Year of the Looking Glass
The Data Science Mindset: Six Principles to Build Healthy Data-Driven Organizations
Do you want to make your organization more data-driven, but you're not sure where to start? Whether you're a business or technical leader, this detailed framework provides a lifecycle to structure the development of your data science projects.-InfoQ
Real Scientists Make Their Own Data
“Your best chance to make a serious contribution as a business or academic researcher is to find, make, and combine novel data.”-Sean J. Taylor
4 Lessons to Keep Your Data Model Under Control
As data sources are added and complexity grows, the data model makes a big impact on the productivity and efficiency of anyone in the company that works with data.-Mode
A Culture of Partnership
“An analytics team that operates in a silo, despite being filled with super talented analysts, engineers, and data scientists will often end up spinning its wheels and failing to impact the bottom-line of the organization.”-Locally Optimistic
Beware the data science pin factory: The power of the full-stack data science generalist and the perils of division of labor through function
This division of labor by function is so ingrained, we're quick to organize our data science teams accordingly: one person to source the data, one to model it, one to implement, etc. But when your product is still evolving and the goal is to learn, a generalist approach is the way to go.-MultiThreaded
The AI Diet
“Only recently, with the ability to analyze large data sets using artificial intelligence, have we learned how simplistic and naïve the assumption of a universal diet is. It is both biologically and physiologically implausible... A good diet, it turns out, has to be individualized.”-The New York Times
Empathy and the Art of Analytical Persuasion
Even the most sophisticated analysis is no match for an apathetic audience. Over years of practice, we've developed a framework for impactful, empathetic presentation of data.-Mode
Git Your SQL Together (with a Query Library)
You will always need that query again. That's why you should set up a git repository for saving and sharing queries, and track any changes made over time.-Haystacks
How We Created a Visual Search Engine for Hayneedle
This post is an incredibly detailed and excellent reference for anyone considering building visual search for their product.-The Jet Tech Blog
The Problem With Hands-Off Analytics
As data scientists, we should be clear-eyed about our primary responsibility: to turn data into insight, and insight into action. Attempting to hand off these problems doesn't change that. Providing a tool to someone to get the data doesn't offload that responsibility.-Mode
The Analytics Engineer
In the shifting landscape of analytics, another role rises: someone whose job is to build tools and infrastructure to support the efforts of the analytics and data team as a whole.-Locally Optimistic
You're Only 30 Minutes From Answers
How do you ship value early when you have to build infrastructure first? Today, it takes hours—not months—to get an entire data infrastructure up and running thanks to off-the-shelf tools.-Mode
The ‘Moneyball’ Solution for Higher Education
4 out of 10 students who start college and don’t finish in at least six years. Georgia State's predictive analytics system monitors 800 different academic risk factors and identifies students who need support to graduate on time.-Politico
Overplanned Analytics Initiatives Are Doomed to Fail
The road to changing the fundamental decision-making culture of an organization always has twists and turns. So what separates the ones who navigate them successfully from those that don't?-Mode
Creating a Data Road Map
Since data often works cross functionally with other teams, it’s critical to consider other team’s priorities and objectives in developing your road map. Here’s a blueprint to get started.-Locally Optimistic
What Great Data Analysts Do — and Why Every Organization Needs Them
“Far from being a lesser version of the other data science breeds, good analysts are a prerequisite for effectiveness in your data endeavors.”-Harvard Business Review
Mo models, mo problems: tracking the quality of models in production
Caitlin Hudon asked “Data scientists: What’s your team’s approach to tracking the quality of models in production?” and got a load of great advice. Check out the original thread or the summary in this doc.-Caitlin Hudon
The Netflix Data War
This take on the “war” between the Netflix’s data team and content team examines what happens when the data says one thing, but other business factors need to be taken into consideration.-Simply Statistics
Bye bye Mongo, Hello Postgres
Why and how The Guardian switched off the Mongo DB cluster used to store their content after completing a migration to PostgreSQL on Amazon RDS.-The Guardian
Data Science vs Engineering: Tension Points
Want to get a good understanding of how data science turns into data products? This article is for you.-Domino
Predicting the Real-time Availability of 200 Million Grocery Items
Ever wished there was a way to know if your favorite Ben and Jerry’s ice cream flavor is currently available in a grocery store near you? Instacart’s machine learning team has built tools to figure that out.-Engineering at Instacart
Convoys: a Library for Modeling Time-lagged Conversion Rates
When it takes a while for someone to convert, predicting conversions gets tricky. You know who converted, but if someone did not convert, they might still convert in the future. Survival analysis to the rescue!-Better
Building Data Dictionaries
Cataloging all the data your company collects is a massive undertaking. Get started with these examples and best practices. (And remember, one step at a time!)-Haystacks
Don't Choose Dashboards Over Analysis
An analyst's real contribution doesn't come from delivering dashboards—it comes from solving problems. And often, the choice ends up being mutually exclusive. Learn how the best companies we know stay focused on analysis over dashboards.-Mode
Your Client Engagement Program Isn't Doing What You Think It Is.
Stitchfix deploys a “contextual bandit” framework, which uses algorithms to learn the most effective strategy for engaging each individual client.-MultiThreaded
Bridging the Gap Between Business Analytics & DevOps
How the analytics team at BetterCloud has worked in harmony with their DevOps team to ensure they're accessing the data they need without sacrificing security.-BetterCloud
An Introduction to the Data Product Management Landscape
Because of data’s new-found role as a key product and competitive advantage, data product management has emerged as a new career path. These roles run the gamut from infrastructure to analytics to applied AI and machine learning.-Insight Data Science
What is Production?
“While there are good reasons to be careful whenever you make changes that could impact customers, I believe that as software becomes more data-driven it is critical to find safe ways to empower Analytics teams to build and deploy data-driven applications.”-Locally Optimistic
From Data to Action With Airbnb Plus
A peek into the lives of Airbnb Data Science interns. Particularly interesting is the explanation of Airbnb's three data science tracks: Analytics, Algorithms, and Inference.-Airbnb Engineering & Data Science
The Role of an Analyst in the Age of the Citizen Data Scientist
What do analysts do when the people to whom they have been providing data begin to get that data for themselves?-Mode
Experimentation & Measurement for Search Engine Optimization
When the Airbnb product team wanted to test new landing pages to boost SEO, they realized that a traditional A/B test wouldn't cut it. Here's the “market-level” framework they implemented to measure how the new pages affected traffic.-Airbnb Engineer & Data Science
deon: An ethics checklist for data scientists
This command line tool takes the concept of data science ethics from theoretical to practical by allowing you to easily add an ethics checklist to your projects.-DrivenData
3 Analytics Leaders on Building Efficient Teams
How do you tell if your analytics team is efficient? Making an impact? If you're steering the analytical ship at your organization, you'll want to give this a read.-Mode
The Big Four Reasons Companies Struggle to Hire Data Talent
We hear from both sides of the data talent market from the thousands of data scientists, analysts and others who use Mode every day. Here are four common problems we’ve noticed companies face when hiring for data talent, and how you might fix them.-Mode
Doing good data science
“Moving fast and breaking things is unacceptable if we don’t think about the things we are likely to break. And we need the space to do that thinking: space in project schedules, and space to tell management that a product needs to be rethought.”-O’Reilly
Apple is rebuilding Maps from the ground up
Google Maps’s moat may dry up soon. Apple Maps is moving from relying on third-party data to owning all of the data that goes into making a map, without compromising their firm stance on protecting user privacy.-TechCrunch
3 Weird Analytical Practices at Mode
We offer a peek into Mode’s internal analytics culture, including the fact that the most technically proficient folks actually sit on our Customers Success team.-Mode
Reporting is a Gateway Drug
Some advice on how to use reporting as a means to create strong stakeholder relationships in your organization.-Locally Optimistic
Data Violence and How Bad Engineering Choices Can Damage Society
“If you have the temerity to insert your work into a political issue that… doesn’t immediately affect your life, you should also be prepared to accept the consequences—or, at the very least, answer a few hard questions.”-Medium
How to Attract Top Data Science Talent: Lessons from 1,400+ Insight Fellows
The head of Insight Data Science shares three key areas for differentiation to focus on throughout the hiring process.-Mode
How Grubhub Analyzed 4,000 Dishes to Predict Your next Order
Grubhub had 14 million menu items and the only thing they had in common was that sometimes people ate them. Here’s how their data team built their own taxonomy of food.-Wired
Use These Data Analytics Tips to Find Your Film’s Audience
The film industry is years behind others—like interactive and music—with regards to access to data. This post breaks down how filmmakers can collect data at all times throughout their art-making to build audiences and maximize revenue.-Sundance Institute
The (Data Science) Notebook: A Love Story
Speaking of which… what’s driving the rapid adoption of the notebook interface as the preferred environment for data science work?-Mode
Data-Driven or Insights-Driven? Data Analytics vs Data Science
This is a great example of how to handle objections that come up in conversations with folks who aren’t entrenched in data all day.-Jen Stirrup
8 Great In-App Analytics Pages in B2B Software
You probably have data that can help your users do their jobs better and frame your business in a positive light. Here are some great examples of companies providing valuable usage data to their customers through in-app analytics.-Mode
How to Create a Great In-App Analytics Page
If you’re kicking off a project to build your own in-app analytics, keep these six considerations in mind.-Mode
Scaling Event Tables with Redshift Spectrum
As Mode’s customer base grew, we reached a point where our infrastructure wasn’t capable of handling the exponentially increasing volume of event data. Here’s how we saved Redshift performance by offloading 75% of our event data to S3 in less than a week.-Mode
So here’s my postmortem after hunting for a data science job
“I’m tired of all the Medium thought pieces on how to become a data scientist because they don’t reflect the reality of getting a relevant job from the applicant’s side. And it’s hard, especially without a Masters/PhD.”-Max Woolf
How Mode’s User Stats Page Answers Customer Questions While Increasing Feature Use
Looking for a way to highlight underused features? Are customers hounding you about the value they’re getting out of your product? A usage page might just kill those two birds with one stone.-Mode
How Quora’s Head of Data Science Conducts Candidate Interviews
Anyone who has considered leaving the ivory tower should give this a read.-Mode
Transitioning From Academia to Industry: Perspectives from Indeed’s Data Scientists
Eric Mayefsky has assessed hundreds of job candidates in his half decade in management at various tech companies. Here are the five key lessons that have helped him build an amazing data team.-Indeed Data Science
Recommended companies for early-career data scientists
This is a goldmine for junior data scientists looking for companies with defined career paths and mentorship opportunities.-Hilary Mason
One Year as a Data Scientist at Stack Overflow
Lots of great reflections in here about being a data scientist at a small company, the R workflow, and working remotely. We especially love this: “Data science is highly technical work, but the value of my technical work would be much lower if I could not communicate what it means in clear and compelling ways.”-Julia Silge
Directions of Ascent
What can individuals who work with data and code do to “fix the mess that tech has enabled”?-arg min blog
Google Maps's Moat
Twitter user @msquinn put it best: “This is a great look at how the Google Maps data strategy first conceived over a decade ago continues to unfold in the product you see today.”-Justin O’Beirne
Bridging the Trust Gap: Data Misuse and Stewardship by the Numbers
Consumers don’t understand how companies actually use their data. Meanwhile, companies think more consumers understand data stewardship practices than actually do.-The Boston Consulting Group
Engaging the Ethics of Data Science in Practice
“The critical writing on data science has taken the paradoxical position of insisting that normative issues pervade all work with data while leaving unaddressed the issue of data scientists’ ethical agency. Critics need to consider how data scientists learn to think about and handle these trade-offs, while practicing data scientists need to be more forthcoming about all of the small choices that shape their decisions and systems.”-Association for Computing Machinery
Data Meta-Metrics
How do you communicate confidences and doubts about data to a non-technical audience? Check out one analyst’s method for adding a “state of the data” aspect into her presentations to get the whole team involved in the data improvement process.-Caitlin Hudon
Corporate Data Science
A great talk for those tasked with growing a data science team, covering how to optimize a team's makeup across many dimensions and instill in them the importance of caring deeply about data collection, security, ethics, and interpretability.-Angela Bassa
Communication in data science
When people stress the importance of good communication in data science, they're usually talking about communicating results—the last step in a data scientist's workflow. But communication is more than just a final bottleneck. It’s important at every stage.-University of British Columbia
Avoiding Being a 'Trophy' Data Scientist
A collection of the challenges data scientists face in their quest to add value to a company.-Peadar Coyle
How to Job Interview a Data Scientist
Mode CEO Derek Steer explains where most data scientist job interviews fall short and three key criteria for evaluating candidates.-Mode
The Four Cringe-Worthy Mistakes Too Many Startups Make with Data
HotelTonight's Chief Data and Strategy Officer shares why you should run your data team like product and think twice before hiring a data scientist.-First Round Review
Big Data Processing at Spotify: The Road to Scio (Part 1)
Using Scio, a built in-house Scala API, Spotify is able to run the majority of their workloads with a single system, with little operational overhead.-Spotify Labs
From Power Calculations to P-Values: A/B Testing at Stack Overflow
If you're a little fuzzy on the relationships between sample size, effect size, false positive, and false negative rates, this post and the accompanying interactive calculator will clear things up.-StackOverflow
A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments
“In our experience of running thousands of experiments with many teams across Microsoft, we observed again and again how incorrect interpretations of metric movements may lead to wrong conclusions about the experiment’s outcome, which if deployed could hurt the business by millions of dollars.”-Microsoft
ZATA: How we used Kubernetes and Google Cloud to expose our Big Data platform as a set of RESTful web services
An inside look at zulily's data platform, which makes data accessible to analysts, systems, and applications without sacrificing speed or storage options.-Tech @ zulily
Landing a Data Science Gig in New York City
Trying to break into the NYC data science job market? Sans a PhD? This guide was tailor-made for you.-Ground Truth
How Stitch Consolidates A Billion Records Per Day
Ever wanted to know how the people who make ETL tools set up their data infrastructure? Wonder no more.-StackShare
Data Science for Fraud Detection
A primer on the inner workings of fraud analysis, from dimensionality reduction to anomaly detection.-codecentric
5 Ways to Make Sure Your Analytics Spark Growth
From setting the right KPIs to sending events to the right place, here are the steps every company should take to make their product and marketing analytics worthwhile.-Astronomer.io
Data Security for Data Scientists
The Equifax breach is yet another reminder that data security is no longer a niche speciality of database admins and network engineers. Here are 10 suggestions for ensuring the data you work with is properly protected.-Andrew Therriault
Train, Score, Repeat, Watch Out! Zillow's Andrew Martin on modeling pitfalls in a dynamic world.
One of Zillow's data scientists addresses the challenges that don’t crop up in standard textbook problems or most ML competitions: feedback loops, dynamic datasets, and temporal consistency. A great read for Kagglers and non-Kagglers alike.-No Free Hunch
Switching to a Probabilistic Model for Venue Search in Foursquare
How Foursquare’s engineering team improved the accuracy and user experience of their location intelligence by switching from a search ranking algorithm to regression trees and probabilities.-Foursquare Engineering
The Data Trust Gap and How to Close It
78% of people have trouble trusting companies with their data; that means if your customers do trust you with their data, you have a competitive advantage. How do you get there? By making sure data transparency isn’t an afterthought.-InVision
Data Science without Borders
In his JupyterCon keynote, Wes McKinney makes the case for a shared infrastructure for data science, discusses the open source community’s efforts on Apache Arrow, and offers a vision for seamless computation and data sharing across languages.-O'Reilly
Choosing an ETL tool for your analytics stack
In the market for an ETL solution? Here's the criteria we employed when we evaluated ETL vendors for our own use here at Mode.-Mode
Scaling Analytical Insights with Python (Part 1)
FloSports' VP of Product shares the subscriber retention analysis that allows their business to move quickly as the number of data sources and volume of data increases.-Kevin Boller
Data Science: Challenges and Directions
Carve out some time in your schedule for this academic paper on “the gaps between the world of hidden data and existing data science immaturity.” It's a dense but rewarding read.-Communications of the ACM
Timing is everything: what our data says about the best time to send a message
Intercom analyzed millions of in-app and email messages from B2B companies to figure out when to send messages so they'll reach the maximum amount of people possible. This is a cool example of turning product analytics into actionable insights for customers.-Inside Intercom
Cargo cult data science
A “cargo cult” emerges when people copy a set of behaviors without understanding the “why” behind them. Many data science projects fail because they focus on implementing the technical side without pushing for a cultural shift to become data-driven. Here's how to avoid that pitfall.-Richard Weiss
I have data. I need insights. Where do I start?
What to do when your boss dumps a bunch of data in your lap and says “tell me something interesting.”-Towards Data Science
You Say Data, I Say System
Every spreadsheet or database view or visualization is the result of an entire system of decisions: how to collect, compute, and represent the data. This article provides an excellent framework for being mindful of the choices that shape the end product you see on your screen.-Hacker Noon
Rise of the Data Product Manager
“Working with data at the core of a product requires… a deep appreciation for what is possible and what will soon be possible by taking full advantage of the flow of data.”-Trey Causey
How To Get People to Value Your Analytics work — the Principle of Scarcity
It doesn’t matter if you deliver the best analyses in the world if you can’t persuade people to make use of those analyses.-Jon Tesser
How R Powers Data Science at Microsoft
“Every application we have today… can be made more intelligent by having a layer of R in the middle.”-insideBIGDATA
Two years as a Data Scientist at Stack Overflow
Two years in, David Robinson chronicles hiring a second data scientist, teaching R to developers, and writing production code.-Variance Explained
Getting started: the 3 stages of data infrastructure
Setting up data infrastructure no longer feels like 'trying to build a skyscraper using a toy hammer.' Nowadays there are so many options that assembling a data stack is downright daunting. Take a deep breath, and start here.-Nate Kupp, Thumbtack
Athos Changes the Game
Athos is revolutionizing sports performance with muscle activity based data. Learn how this smart startup uses Mode for fast access to business intelligence.-Mode
How to Hire a Product Analyst
6 things to look for in a product analyst + 14 questions you can use to tell a good candidate from a great one.-Amplitude
The Algorithms Behind Moana’s Gorgeously Animated Ocean
Disney engineers used a series of algorithms to simulate realistic water movement, like splashes, eddies, and wakes. The simulation output millions of new points of animation data that were smoothed into the final rendering of the film.-The Atlantic
New Leader, Trends, and Surprises in Analytics, Data Science, Machine Learning Software Poll
KDnuggets released the results of their 18th annual analytics and data science software poll. TL;DR: Python barely overtook R and Deep Learning usage surged to 32%.-KDnuggets
How Airbnb Democratizes Data Science With Data University
“In order to inform every decision with data, it wouldn’t be possible to have a data scientist in every room—we needed to scale our skillset.”-Airbnb Engineering & Data Science
Methodologies as Vanity Metrics
“When you work on learning new methods (Now I know Random Forest! Now I know K-L Divergence! Now I know Deep Learning!) it feels good—you’re exercising your brain, you know something you didn’t before—and it’s easy to think you’re progressing. But methods don’t in and of themselves drive value.”-Ian Blumenfeld
Spotting a million dollars in your AWS account
“Shooting for 80% completeness (being willing to say 'it's good enough') ended up saving us again and again from rabbit-holing into analysis that didn’t meaningfully impact our spend.”-Segment
How we learn how you learn
Duolingo leveraged their product data to create a new statistical model for effective language learning.-Making Duolingo
Data Driven Products Now!
Here's Etsy's exact process for validating new products and features with data, before starting in on the development process.-Dan McKinley
Building Data Science Teams
Should data science teams be standalone or embedded or integrated completely? How should a company conduct their data science hiring process? Instacart's VP of data science shares their secret sauce for keeping data folks happy and productive.-Jeremy Stanley
The Key to Growth? Illuminating Your Best Bets
How Airbnb and Facebook identified their North Star metrics for growth.-Startup Grind
Reliable export of Cloud Pub/Sub streams to Cloud Storage
Here's how Spotify's data infrastructure team set up an event delivery system to handle over 100 billion events generated each day.-Spotify Labs
7 Disruptive Trends to Watch For in Analytics in 2017
A wide-sweeping look at why data tools are changing to address problems like governance, collaboration, and backlogs of data requests.-Graphiti XYZ
Architecture of Giants: Data Stacks at Facebook, Netflix, Airbnb, and Pinterest
Learn how tech behemoths store and process petabytes of interactions, page views, and customer data.-Keen.io
Using dbt and Mode to Help Eko Rebuild its Analytics Stack
Learn how interactive video player Eko built a new data modeling layer and kept data consistent across internal and external reporting.-Mode
Custom Data Visualizations in the Workplace
ICYMI, here's a recap video of of presentations from data team leaders at Envoy, Good Eggs, and Thumbtack on their highest-impact work, how projects are conceived and received, and tips for identifying when to to invest in custom data viz.-Mode
How You Battle the "Data Wheel of Death" in Growth
“Most companies that want to get more serious about data approach it as a project. Something with a definitive start and a definitive end. In reality, data is an ongoing, never-ending project, similar to building a product.”-Coelevate
Data Minds Episode 6
Mode's CEO talks the most common misconceptions about analytics, the skills that data people need to foster to become team leaders, and how the data team can make the case to invest in the tools they need.-Data Minds
How to stay out of analytic rabbit holes: avoiding investigation loops and their traps
“[O]ver-analysis begins when the data scientist starts focusing on the hypothesis instead of the decision.” Here's how to tell the two approaches apart and avoid analysis paralysis.-cyborgus
The Startup Founder’s Guide to Analytics
“How do I build a business that produces actionable data?” is much harder to answer than “What metrics should I track?” Here's a quick start implementation guide based on the growth stage of your company.-ThinkGrowth.org
3 Tips for Centralizing Your Analytics Team Structure As You Grow
In practice, centralized analytics teams often report into product while supporting the needs of the entire business.-Mode
Data Science On The Silicon Beach
In this interview, the Chief Data Officer for the city of San Diego discusses his team’s ad-hoc approach, integrating their stack with legacy systems, and his plans for employing data to alleviate traffic congestion.-Partially Derivative
Hiring a data scientist
Hiring for a data analyst is no easy task. Wikimedia shares how they drew on existing resources to synthesize a better approach to interviewing and hiring a new member of their data team.-Wikimedia
How Fitbit’s data science team scales machine learning
Workout regimens need to be tailored to each individual. Directional correctness isn’t enough. Fitbit’s head of data science shares how his team builds a model for every user to increase motivation and prevent injuries.-Mixpanel
Scaling Data Science at Stitchfix
Not many companies can say they employ 80 data scientists. The folks at Stitchfix share their tactics for making data and compute resources more accessible—which in turn keeps data scientists happy and infrastructure healthy.-MultiThreaded
Building & Maintaining a Master Data Dictionary: Part 2
Check out these ideas for structuring key metric definitions to keep everyone at your organization on the same page.-The Data Point
What’s it like to work in sports analytics?
From the outside, crunching numbers for a national sports league seems glamorous. The cold hard truth? It’s an often thankless job with low pay and long hours. The only thing that’ll prevent burnout is a pure love of the game.-StatsbyLopez
Quora Session with Monica Rogati
The Former VP of Data at Jawbone did a Quora session last week.-Quora
How To (Actually) Calculate CAC
Quick: What’s the difference between customer acquisition cost (CAC) and cost per acquisition (CPA)? If you hesitated, this post is for you.-Brian Balfour
Breaking the Vanity Metric Cycle
“[B]reaking free of worthless metrics is hard because it is breaking a psychological reward, not just adopting some new stats.”-Amplitude
The Limitations Of Data And Benchmarks
“All the quantitative analysis in the world won’t lead me to the next great idea for startup. Those figures can’t create empathy, develop the right culture, or hire the right people.”-Tomasz Tunguz
Trust in Data Science
“An untrusted analysis is an unused one, regardless of the quality. So how does one go about building, or rebuilding, trust in the face of challenges and failure?”-Clover Health
8 Data Science Skills That Every Employee Needs
A nice primer to share with your colleagues.-Amplitude
Practical advice for analysis of large, complex data sets
“This document has been read more than anything else I’ve done at Google over the last eleven years. Even four years after the last major update, I find that there are multiple Googlers with the document open any time I check.”-The Unofficial Google Data Science Blog
Ten Ways Your Data Project is Going to Fail
“Many companies seem to go through a pattern of hiring a data science team only for the entire team to quit or be fired around 12 months later. Why is the failure rate so high?”-Martin Goodson
Why I’m Teaching Twitch to Predict the Future
Forecasting is a good habit to adopt in the workplace. It’ll help you figure out the odds of delivering on your goals. Plus, having a record of accurate predictions builds trust in your work and analytical thinking in general.-Twitch
Tracking Customer Service Metrics With SQL
This guide includes a dozen SQL queries for calculating customer service metrics with raw Intercom data.-Mode
Data Literacy, Product Design and the Many-Faced God
“Building a team that’s doing ‘cutting-edge research in deep learning, machine intelligence, and artificial intelligence’ is not easy—not in this hiring environment. But infusing data thinking throughout a company is orders of magnitude harder. This matters, because data thinking permeates your products and can make them feel ‘smart’—or not.”-Monica Rogati
Don’t Become a Victim of One Key Metric
“[T]he search for one key metric at all for a complex ecosystem like Pinterest over-simplifies how the ecosystem works and prevents anyone from focusing on understanding the different elements of that ecosystem. You want the opposite to be true.”-Casey Winters
GGPlot2 As a Creativity Engine, and Other Ways R is Transforming the Financial Times Data Journalism
Learn how the Finanical Times produces high-quality data visualizations in this presentation, complete with the R code and data used for their piece, Explore the changing tides of European footballing power.-Financial Times
Surviving Data Science “at the Speed of Hype”
Complex optimization models work best when they’re asked to deal with stable business problems, like airline scheduling or ad targeting at Google. But at a startup, where the business model is constantly changing, simply summarizing data is a much better way to find answers.-John Foreman
How We Rebuilt the Wall Street Journal’s Graphics Team
The WSJ used to have two Graphics teams—one for print and one for the web. Combining the two has allowed editors to focus on storytelling from the start of projects, instead of the medium.-Source
What I Wish I Knew About Data For Startups
One entrepreneur reflects on his learnings from four years of working with data at a startup. It’s a goldmine of advice on building a strong, scaleable data culture. Don’t skip this one. Seriously.-Jean-Nicholas Hould
Simple requirements gathering questions for dashboard design
Next time someone asks you to make a dashboard, pull this list out. It provides a framework for sussing out what’s needed for the dashboard to be useful and effective.-Paint by Numbers
FiveThirtyEight’s data journalism workflow with R
FiveThirtyEight’s quantitative editor shares the analytical process behind some of their publication’s most popular articles.-useR!
The Data Driven Daily
This newsletter provides definitions of business KPIs, how to calculate them for your business. This week they’re covering how to determine the size of your potential customer market. The archive is well worth perusing; past segments include revenue calculation and pricing strategy.-Outlier
One year as a Data Scientist at Stack Overflow
The chronicle of one data scientist’s transition from academia to the tech industry, combined with a peek into Stack Overflow’s machine learning and data infrastructure projects.-David Robinson
Whom the Gods Would Destroy, They First Give Real-time Analytics
Every few months, I try to talk someone down from building a real-time product analytics system. When I'm lucky, I can get to them early.-Dan McKinley
Real-time dashboards considered harmful
There’s a certain allure to real-time data: your team can see what’s happening right now and take action immediately. Ultimately, though, most real-time dashboards create a bunch of noise that distracts you from more important metrics.-Basecamp
Boosting Sales With Machine Learning
One developer shares how his team used natural language processing and machine learning in Python to pre-qualify sales leads so reps don’t have to spend hours doing it manually.-Xeneta
Scaling Knowledge at Airbnb
Airbnb’s data team shares their solution to ensuring insights don’t get lost in Google docs or email threads: a centralized knowledge repository.-Airbnb Engineering
Bridging the Gap Between Data Science and Data Engineering
Josh Wills, Director of Data Engineering at Slack, shares his thoughts on how data engineers and data scientists work best together.-Hakka Labs
Statistically Interesting
Craving a new data science podcast? Check out Statistically Interesting, a series of interviews with data science leaders at companies like Twitch, Vimeo, and Weebly.-Statistically Interesting
How to Make Reps Care About Data Quality
When a sales rep fails to record information about her activities or clients, it can lead to incomplete and inaccurate reports and forecasts. These tips and tricks will help sales leaders encourage reps to be vigilant about consistently logging data.-InsightSquared
The View From The Data
Making data-informed decisions has a lot more to do with people than it does with the actual data.-Karen Roter Davis
Building Data Science in Healthcare
Many tech companies have complete control over the format of the data they collect. Healthcare, which relies on external data about patients and their interactions, has no such luxury. Ian Blumenfield, Head of Data Science at Clover Health, shares how they handle messy data and the other unique data challenges the industry faces.-Clover Health
This Is How You Build Products for the New Generation of ‘Data Natives’
We’ve grown used to the idea of digital natives—the toddler who expects everything to be a touchscreen and pinches and swipes her fingers on TVs and magazines. But data natives are something different: they expect “their world to not just be digital, but to be smart and to adjust immediately to their taste and habits.” Monica Rogatti, former VP of Data at Jawbone, shares ideas for harnessing data to build products for these new consumers.-First Round Review
So you want to build a data business? Play the long game
Foursquare has demonstrated, once again, that it’s capable of predicting public company earnings with an incredible degree of accuracy based on real world foot traffic data.-Michael Carney
Hot property: How Zillow became the real estate data hub
Zillow is a real estate powerhouse, and one of their biggest competitive advantages is their massive dataset of property listings. The most interesting part of this article goes into how their data science team brings together messy data from disparate sources to create one coherent super-dataset.-InfoWorld
The Art and Science of Storytelling Through Data at Jawbone
Analysis can be worthless if it’s not communicated well. Jawbone data scientist Kirstin Aschbacher shares how she develops a data story that inspires action, from concept to presentation.-Insight Data Science
How Does the Data Science Team Work at Twitch?
In this interview, Twitch's Director of Science shares how the data science team thinks about mentorship, gaining leverage, and qualitative research.-Mode
Analyzing Your Stripe Data, Part 1: Measuring Subscription MRR
Got raw Stripe data? Want to calculate your subscription monthly recurring revenue? Lucky for you, this post provides the SQL queries you’ll need, tips for data prep, and ways to tailor the analysis to your business.-Analyst Collective
Doing Data Science Right — Your Most Common Questions Answered
This is a must-read for startup founders who want to build data science teams. It’s packed with details on the inner-workings of data-driven businesses and advice on where to start based on your company’s needs.-First Round Review
Building a high-throughput data science machine
Scaling is a problem every data science team faces. How do you go from one nomadic analyst roaming between departments to a structured team? The answer is a little different for every company, but this interview introduces some best practices to keep in mind.-O’Reilly
How to Find Correlative Metrics For Conversion Optimization
A thorough walk-through of how to find correlative metrics and leverage them for conversion. It’s jam-packed with examples and advice from experts, plus a handy list of tools.-ConversionXL
Why Airbnb Has a Data Scientist on Every Leadership Team
Airbnb's head of data science shares his keys for success in data and business.-Inc.
Riley Newman on Data Science for Startups
In this interview, Airbnb’s head data scientist Riley Newman talks about building a strong data culture, balancing technical skills with storytelling ability, and scaling data science at a high-growth startup.-Intercom
How does Lumosity use data science?
An inside look at the structure of Lumosity’s data science team and the internal tools and product features they build.-Quora
CAC Payback Period: The Most Misunderstood SaaS Metric
If you’re calculating customer acquisition cost payback period for a SasS product, keep these two things is mind: payback metrics are about risk, not return, and that most SaaS products operate on an annual model, not monthly.-Kellblog
How to catch million dollar mistakes before they cost you millions of dollars
Are you measuring the impact of back-end updates on user behavior? Failure to do so could cost you big time.-Lucidchart
The Five-Step Guide to Robust Help Center Metrics
When a documentation manager set out to revamp her company’s help site content, she was surprised to find very few resources on how to measure her project. Thankfully, she documented her journey so we can all learn from it. Great tips in here for anyone looking to make their help center more, well… helpful.-RJMetrics
The Role of Statistical Significance in Growth Experiments
When you run an experiment, you’re looking for statistically significant results. But if you’re running growth experiments on a product—iterating quickly to optimize—the standard rules of statistical significance may not apply.-Medium
Minimum Viable Onboarding for PMs
A product manager from Doordash shares his thoughts on the most successful employee onboarding process he’s experienced. Spoiler—it involves data analysis.-Charlton Soesanto
7 Steps to Measuring the Success of a Feature
You’ve spent months working on a feature and now it’s live. How do you tell if users actually like it? Dig into your user data and start measuring with this detailed walkthrough.-Amplitude
Highly Effective Data Science Teams
To do great data science work, you need more that a huge heap of data. This article offers 14 criteria for assessing your team’s effectiveness.-Twitch
What BuzzFeed’s Dao Nguyen Knows About Data, Intuition, And The Future Of Media
This entire article is worth reading, but skip to the middle for the real gem—publisher Dao Nguyen’s holistic philosophy on data at BuzzFeed. “[Y]ou can’t only use comments, you can’t only use data, you can’t only use anything. You can’t only use your own intuition, either. It has to be all of those things you use.”-Fast Company
Diligence at Social Capital, Epilogue: Introducing the 8-ball and 'GAAP for Startups'
Figuring out what metrics to present to investors can be a struggle for startups. That’s because there’s really no standardized metrics or reporting in the startup world. Venture capital firm Social Capital is hoping to change that with their tool for gauging product-market fit at early stage companies. Plug in your own data and give it a whirl.-Jonathan Hsu
Building a business that combines human experts and data science
An insightful interview with Eric Colson about algorithms, human computation, and building data science teams at Stitch Fix and Netflix.-O’Reilly Data Show Podcast
You’re Measuring Daily Active Users Wrong
A high number of daily active users (DAU) may sound impressive, but does it actually mean anything? To make your DAU metric actionable, you need to measure how often users are getting core value out of your product, not how many times they log in.-Amplitude
The Ecommerce Holiday Customer Benchmark
Those new customers from the holiday season are more valuable than you thought. So when should you engage these shoppers to turn them into repeat customers?-RJMetrics
How Toyota Revamped Its Collections Biz with Big Data Analytics
Toyota Financial Services (TFS) used to collect car payments with a one-size-fits-all approach. Then the recession hit. For the first time, 100,000 people a day were behind on their payments. With a massive analytics overhaul, TFS were able to personalize their collection strategies and help 6,000 customers keep their cars from being repossessed.-Datanami
How Instacart Uses Redshift to Drive Growth
In this interview, Fareed Mosavat, growth PM at Instacart, shares how his team combines behavior, shipping, and fulfillment data to inform product decisions. Check out how his team uses SQL to define internal metrics, conduct A/B tests, and discover how many touches it takes before a user makes their first order.-Segment
Wrangle Conference 2015
If you didn’t get to attend the Wrangle Conference in October, now’s a good time to catch up.-Cloudera
Fashion Goes Deep. Data Science at Lyst
Fashion moves quickly. So, too, does the data science that powers e-commerce sites. In this interview, Lyst lead data scientist Eddie Bell shares the ins and outs of their recommendation engine. Learn how his team has tackled the challenges of constantly changing merchandise and kept suggestions fresh using machine learning and image analysis.-Fast Forward Labs
Data Down on the Farm
This episode is part of a series about data and food from Andreessen Horowitz. Learn how farmers are using software and analytics programs to monitor crop health and performance, implement agricultural policies, and adopt revenue-focused business practices.-Andreessen Horowitz