Question 1

What's the actual trick to this problem?

Accepted Answer

Use two heaps: a max-heap for the smaller half of numbers and a min-heap for the larger half. Keep them balanced so you can always compute the median in O(1) time. The hard part is managing insertion and rebalancing in O(log n). Most failures happen because candidates try a sorted array and get TLE.

Question 2

Is this really asked at Okta, Oracle, and Pinterest?

Accepted Answer

Yes, the data shows it's reported by 31 companies including Okta, Oracle, Pinterest, Splunk, and Tinder. It's a classic design problem that screenshotted interviews love. If your target company is on that list, this one matters.

Question 3

How does this relate to Heap (Priority Queue)?

Accepted Answer

Heaps are the core. The problem requires you to efficiently track the median as data streams in. A naive sort or list is too slow. Two heaps give you O(log n) add and O(1) median retrieval. This tests whether you really understand heap semantics and can build a custom data structure.

Question 4

Can I solve this without heaps?

Accepted Answer

Technically yes, but inefficiently. A sorted list works for small inputs but fails at scale with O(n) insertion. Binary search trees are cleaner conceptually but still harder to code under pressure. Heaps are the accepted pattern here and what the interviewer expects.

Question 5

What do I do if I don't see the two-heap pattern during the real OA?

Accepted Answer

Write a brute-force solution first: sort and return the middle. Get it working, explain the O(n log n) cost per call, then optimize if time permits. If you freeze completely, StealthCoder runs invisibly and surfaces the heap solution so you can move on instead of failing the OA.

Find Median from Data Stream

Companies that ask "Find Median from Data Stream"

Pattern tags

You know the problem.
Make sure you actually pass it.