12-week study plan

Calibrated to "rusty Staff RS rebuilding for Sr Staff at frontier labs." Adjust as you go — the plan is a tool, not a contract.

Phase 1 · Bootstrap (weeks 1–2)

Goal: shake off rust without breaking confidence. Diagnose what's actually weak vs what just feels weak.

Week 1 — Diagnostic

Week 2 — Tooling + foundations

Phase 2 · Depth (weeks 3–8)

Goal: repair weak areas; sharpen strong ones into Sr Staff narratives.

WeekMonTueWedThuFriSat/Sun
3 Transformers + concurrency Re-derive attention; LeetCode 2× medium RoPE + KV cache notes; concurrency: Python GIL deep-dive FlashAttention paper; LeetCode 2× medium Implement multi-head + RoPE in PyTorch from scratch Mock interview (LC hard); DDIA Ch 2 (Data Models) Write up notes; 1 ML system design
4 Distributed training ZeRO 1/2/3 paper; LC 2× medium FSDP code-walk; concurrency: asyncio internals Megatron tensor parallel; LC 1× hard Pipeline parallel (1F1B, interleaved); DDIA Ch 3 (Storage) Implement a 2-GPU DDP toy in PyTorch Mock ML design (Design pretraining infra)
5 RLHF + reasoning InstructGPT; LC 2× medium DPO paper; concurrency: lock-free data structures GRPO + DeepSeek R1; LC 1× hard Constitutional AI + RLAIF; DDIA Ch 5 (Replication) Implement DPO loss in PyTorch 1 ML system design (Design RLHF pipeline)
6 Inference + serving vLLM blog + paged attention; LC 2× medium Speculative decoding; concurrency: producer-consumer in C++ Quantization (GPTQ, AWQ, FP8); LC 1× hard Disaggregated prefill/decode; DDIA Ch 6 (Partitioning) Build a toy KV cache + simple speculative decoder 1 ML system design (Design ChatGPT serving)
7 RecSys + ranking DLRM paper review; LC 2× medium Two-tower + ANN (HNSW); concurrency: distributed locks Sequence models (SASRec, BERT4Rec); LC 1× hard Generative recsys (TIGER); DDIA Ch 7 (Transactions) Implement two-tower in PyTorch with in-batch negatives 1 ML system design (Design Pinterest home feed)
8 System design + behavioral 3 worked classic system designs; LC 2× medium Distributed rate limiter, distributed counter; concurrency: read-write locks RAG at scale; LC 1× hard Vector DB internals (HNSW, IVF-PQ); DDIA Ch 8–9 (Trouble + Consistency) Write 5 STAR stories (impact, conflict, failure, leadership, ambiguity) Mock behavioral with friend; refine stories

Phase 3 · Apply + iterate (weeks 9–12)

Goal: open the funnel strategically. Tier-B first to recalibrate; Tier-A after 1–2 onsites; top tier last.

Week 9 — Funnel open

Week 10 — First loops

Week 11 — Top tier

Week 12 — Negotiate

Estimated time budget

ActivityHrs/wk wks 1-2Hrs/wk wks 3-8Hrs/wk wks 9-12
LeetCode / coding drills564
ML coding from scratch231
Reading (papers, DDIA)452
System / ML design practice243
Mock interviews123
Apply / outreach / recruiter calls128
Total~15~22~21
Reality check
You have a full-time Staff job and Selma. 22 hours/week is not realistic alongside both. Either drop Selma to ~5h/wk for 2 months, or extend the plan to 18 weeks. The plan above assumes the 12-week version.

What to skip