Rebuilt a returns dashboard nobody trusted — and found the numbers were double-counted
The sharer told this exact project in their Amazon interview loop and went on to receive a return offer.
Step into this interview
4 real follow-ups from the actual loop · 1 hard · ~12 min
You answer each question first — only then does the sharer's real take open up.
How they told it
An intern inherited a returns-rate dashboard that ops leadership had quietly stopped using. The interesting part is what they found once they actually traced the query.
Read the full telling
My intern project was to 'refresh' a returns dashboard for one category team. Pretty quickly a manager told me nobody looked at it because the return rate seemed too high. So instead of just restyling it, I traced the SQL behind the top tile. The return rate was computed as returns divided by orders in the same window, but returns lag orders by weeks, so December returns were being divided by January orders. On top of that, one join fanned out when an order had multiple line items, so multi-item orders counted their returns twice. I rewrote it to cohort returns back to the original order date and de-duplicated at the order level. The 'scary' 11% dropped to about 6.5%, which matched what the finance team saw in their own numbers. I built the new version in QuickSight, wrote a short doc on the cohorting logic, and added a data-quality check that flagged if returns ever exceeded orders for a cohort. My manager had the category team start using it again in their weekly review. It was not glamorous, but it taught me that a wrong number is worse than no number.
What they actually got asked
How did you know 6.5% was right and not just a different kind of wrong?
hardWhy cohort by order date instead of just using a rolling window?
mediumWhat would you do if the underlying order data itself were dirty?
mediumDid the dashboard being wrong cause any bad decisions before you fixed it?
easy