Companies — tier list & profiles

Filtered for your stack: Bay Area only · ML-focused · top tier · Sr Staff/Staff IC · no finance. Ranked specifically for a RecSys + pretraining background.

Filter rules applied

The tier list (S → C)

S
Anthropic OpenAI Thinking Machines Safe Superintelligence Meta MSL (internal only)
A
Google DeepMind xAI Cursor Physical Intelligence World Labs Mistral (PA hub) Periodic Labs Magic.dev Mirendil
B
Pinterest Roblox Reddit Uber Airbnb DoorDash Instacart Lyft AppLovin LinkedIn Databricks Nvidia Apple ML Sierra Cohere Perplexity Decagon Cognition Glean Luma Mind Robotics Rhoda AI Black Forest Labs (SF) Together AI Fireworks AI Baseten Cerebras Groq Modular MatX
C
Adobe Firefly Salesforce AI Stripe ML Notion AI Figma AI Snowflake ML Palantir AIP SandboxAQ Hippocratic Abridge Harvey Snorkel Sesame Ndea Harmonic Math Standard Kernel Andromeda Cluster Prime Intellect Inferact Aurora Vercel Coinbase (watch) Plaid (watch) Robinhood (watch)

Tier S = top priority, max comp + frontier impact. Tier A = strong fit + top comp. Tier B = solid Bay Area, RecSys-direct fallback. Tier C = watch / domain mismatch / lower comp.

Tier S — Frontier labs (apply first, with referrals)

Anthropic S

SF · ~3,500 ppl · $380B Series G (Feb 2026) · Blind 4.8

Comp: Staff $920k · T6 Tech Lead RS $2.0–2.5m · monthly vest after 1y cliff. Coatue 2030 estimate $2T.

Roles to target: MTS Pretraining, MTS Post-Training (RLHF/RLAIF), MTS Inference Performance, Product Eng (Claude apps).

Loop: ~20 days. CodeSignal take-home (4-level progressive spec, 90 min). Then onsite: ML round (Colab Q6), concurrency-flavored coding, ML config system design, behavioral. Reference checks during the loop.

Why for you: pretraining + post-training are the strongest teams. RecSys experience translates to personalization / product surface (Memory, Projects).

Levers: Mike Krieger (CPO) sources senior product IC directly via LinkedIn. Best: warm intro from MTS-level (Tom Brown / Sam McCandlish / Jared Kaplan area).

See: Anthropic prep deep dive

OpenAI S

SF Mission Bay · target ~8,000 by EOY 2026 · $852B Mar 2026 ($122B raised — world record) · Blind 4.4

Comp: L5 Staff ~$875k–1.5m · L6 Sr Staff $1.2–2.0m+. RSU (switched from PPU for new hires Jan 2026), immediate quarterly vest. Highest comp band on this list.

Roles to target: MTS Pretraining, MTS Post-Training, MTS Personalization/Memory, MTS Search/Embeddings, Applied AI Eng.

Loop: 2–8 weeks. CoderPad multi-part problems (60–75 min each). 4-6 hr final loop in 1–2 days. Coding bar is binary — they reject for 2/4 even if ML/research/behavioral nail it.

Why for you: Personalization / Memory team is your direct pivot — RecSys at GPT scale. Pretraining team is the alternative.

Levers: many ex-Meta in Pretraining + Multimodal — easy network bridge. Look for ex-Meta GenAI / RecSys colleagues now there.

See: OpenAI prep deep dive

Thinking Machines S

SF + PA · ~80–100 ppl · $50–60B Series A Q1 2026 (world-record Series A; $2B raised in seed) · no Blind data

Comp: $350k–475k base (filings); SWE/RE/RS tracks across Data Infra, Inference, Kernels, RL Systems, Tinker, Pretraining Datasets/Science, Post-Training, Synthetic Data, Vision, Audio. Equity at $50B → $200B+ trajectory.

Roles to target: MTS Pretraining, MTS RL Systems, MTS Inference, MTS Post-Training. Also product eng for Tinker.

Loop: Glassdoor n=14. Reportedly research-style + RE coding, OpenAI/Anthropic-tier bar. Multi-round, ~19 days.

Why for you: Andrew Tulloch is ex-Meta and reportedly turned down $1.5B from Meta to be there. They want infra+modeling depth. Tiny team = high impact.

Levers: ex-Meta GenAI alumni network. Read Mira / John / Lilian's public writing before the loop.

See: TML prep deep dive

Safe Superintelligence (SSI) S

Palo Alto · $32B (Apr 2025) · opaque

Comp: not public. Likely >$700k base just to attract from FAANG.

Roles: all "Technical Staff" — no public job board. Apply via website, slow.

Loop: famously secretive. Vetting candidates for "good character" over hours.

Why for you: Ilya-led pretraining + safety. Long shot but huge upside.

Levers: Daniel Gross (Andromeda) or Daniel Levy.

Meta MSL Sinternal

Menlo Park · ~$25m+ MTS RS (annual!)

If you are eligible to apply internally to Meta Superintelligence Labs (the TBD Lab Zuck/Wang are recruiting for), this is the highest-comp option on Earth right now.

Catch: external hires only via Zuck/Wang direct. Internal transfers possible but very competitive — must be top-of-band performer.

Action: ping your skip / VP about MSL fit before you announce externally. Once you're seen as a flight risk, it's harder to negotiate the internal move.

Tier A — Strong Bay Area AI labs (apply weeks 2–3)

Google DeepMind A

Mountain View · Blind 3.7

Comp: L6 Staff $700k+ · L7 Sr Staff $950k–1.1m. With DeepMind premium and Gemini budget, top of band $700k–$1m for verified pretraining experts.

Roles: Sr Staff Research Engineer (Gemini pre/post-training), Staff SWE on Gemini, Staff MLE on YouTube RecSys.

Loop: 41-day average. Paper deep-dive (60 min) + research framing (60 min) + ML coding (60 min, no AI tools) + standard Google loop. PhD-defense-grade depth.

Why for you: Gemini pretraining + YouTube RecSys are direct adjacencies. Lots of ex-Meta in YouTube ML.

xAI A

Palo Alto + SF + Memphis · ~1,500 ppl · combined $1.25T merger w/ SpaceX (Feb 2 2026) targeting $2T+ IPO mid-2026 · Blind 3.8

Comp: ~$1.2m Staff MTS · immediate monthly vest. SpaceX merger means equity is closer-to-liquid than other private labs.

Roles: SWE (frontier), ML Research Engineer, RecSys (X integration).

Loop: 2–4 weeks (faster than peers). 20-min "hardest tech problem" presentation + Q&A defense. CodeSignal OA (3 problems, 60 min). Mix of algos and thread-safety/concurrency. "Simple and correct beats clever and broken."

Why for you: Colossus pretraining at 200K+ H100s. Recsys for X feed.

Watch: Musk culture + intense PA in-person. Mixed Blind reports. SpaceX merger means equity is closer-to-liquid (IPO mid-2026 per tier list).

Cursor A

SF North Beach · ~50–80 ppl · $60B Series E Q1 2026 · $2B ARR

Comp: median $1.1m · range $850k–1.28m+. Highest median software comp on Levels.fyi. Tracks: Client Infra, Data Infra, Enterprise, Growth, Infrastructure, ML Infrastructure, Product Eng, RL Scaling, Research Scientist, Security.

Roles: MTS Models, MTS Infra, MTS Tab. Tiny team, extreme bar, ~1/50 candidates pass.

Loop: TypeScript-first (Python ok for ML roles). AI tools permitted but interviewer judges your prompts. Sr/Staff: 4–8 hr take-home in their actual codebase. Streaming Markdown parser, streaming edit application, multi-file diff coordination.

Why for you: Tab models are pretrained code models — pretraining experience direct fit.

Culture: flat, no PMs, in-person SF, 50+ hr weeks. Long-shot but stratospheric upside.

Physical Intelligence (π) A

SF · ~150 ppl · $11B (Mar 2026)

Robotics foundation models (π0 series). Pretraining for embodied agents. Modeling-heavy.

Comp: likely $500k–900k. Roles: ML Research Engineer, Foundation Model Eng. Hot space, major upside.

World Labs A

SF · $5B (Feb 2026)

Fei-Fei Li-led. Spatial AI / world models. Pretraining + multimodal modeling.

Roles: Computer Vision, Graphics, Robotics-intersection ML.

Mistral (Palo Alto US hub) A

$14B (Sep 2025)

Pretraining lab. PA US hub focused on Applied AI + Research Engineer hiring. Smaller US team (~50), less RTO-strict.

Comp: US PA hires reportedly $400k–700k total. Equity upside lower (European centroid).

Loop: ~7 stages — LLM theory → coding → past project → tech manager → ML system design → take-home → values.

Periodic Labs A

SF · $7B (Mar 2026)

AI scientist for materials science. Foundation models on scientific data — pretraining-adjacent. Founders ex-OpenAI/DeepMind, backed by Jeff Dean / Eric Schmidt / Bezos.

Reflection A

$27.5B Series C Mar 2026 — Brooklyn HQ

Frontier model lab. Pretraining + autonomous coding agents. Aims for Western open-source frontier model competing with advanced China systems. Brooklyn HQ — fails your geo filter. Watch only unless they open a Bay Area office.

Magic.dev A

SF · $1.5B+ (Aug 2024)

Code model frontier lab with massive in-house compute. Hiring MTS Kernels, Supercomputing Platform. Comp likely $600k–900k+. Pretraining fit if confirmed by recruiter.

Mirendil A

SF · ~$1B (Mar 2026 target)

Ex-Anthropic researchers (Behnam Neyshabur — pretraining theory). Bio + materials science. Strong pretraining DNA, niche domain. Watch for public Staff postings.

Tier B — Strong Bay Area, RecSys-direct fallback

CompanyHQStaff TCWhy for youLoop notes
PinterestSF$570k–959k (IC17)Pure RecSys at scale; ex-Meta network strong3 ML fundamentals + 2 LC hard tech screen; ML theory + LC mediums onsite. Difficulty 3.4/5
RobloxSan Mateo$750kPure RecSys, gaming contextHackerRank OA + Roblox Assessment + 2 onsite (med + hard) + behavioral
RedditSF$610kFeed RecSys + ads. Fully remote OK.Tech screen (build a model from data) → onsite (model design, feature eng, ad applied ML)
UberSF$620k–723k medianRecSys + ranking, scale5–7 rounds over 4–8 weeks; 60/40 ML/coding for MLE; recent DP-on-trees screening reports 2025
AirbnbSF$548k median, up to $1.64mTrust signals, dynamic pricing, personalizationLC med-hard, Airbnb-tagged. DP problems on trees common.
DoorDashSF$550kGeospatial, time-varying ranking, supply-aware60-min DSA round; LC medium with optimize follow-up. Design HashMap, Jump Game
InstacartSF$545kSame flavor as DoorDashBar Raiser; reference checks
LyftSF$475kLower ceiling than Uber but RecSys/eta similarStandard FAANG-style
AppLovinPalo Alto$600kHeavy ML/recsys for ad targeting; pure RecSys fit5 rounds: 3 tech + 1 sys design + 1 behavioral. Classic LRU O(1) → real-system extension. ~3 weeks total.
LinkedInMountain View$402k–700kFeed/jobs RecSys at scale; MS RSUStandard FAANG loop
DatabricksSF$504k–800k–1.65m (L7)Mosaic ML pretraining infra1hr CoderPad screen, LC med-hard, ~20% pass. Reference checks (3).
NvidiaSanta Clara$450k–675kRecSys, robotics ML, inference. Comp lower than frontier.ML system design + project depth. Stars pyramid, ring buffer, parallel coding.
Apple ML / GenAICupertino$550k–865kRebuild mode post-Ruoming-Pang exit; senior pretraining hires getting $1m+5–8 rounds × 45–60 min. 1–2 LC med-hard (rotate matrix, merge K, LRU). Some ML+DSA blends.
SierraSF$15.8B startupCustomer service AI agents. Bret Taylor co-founder.Replaced LeetCode with AI-native onsite: planning + 2-hr building phase using AI tools.
CohereSF/Toronto~$700kPretraining + enterprise post-trainingRecruiter → HM (projects) → onsite: coding + ML + sys design + behavioral. Forward-deployed-platform-engineer flavor.
PerplexitySF$450k median, up to $790kSearch + LLM. RecSys partial fit.Fast ~23-day loop. Python only. Streaming dedup, top-k, beam search, batch inference optimization.
DecagonSF$4.5B startupVoice/customer agentsCoding pair + sys design + past project + behavioral. Tic-Tac-Toe, conversation rolling-window scores, LC #84.
Cognition (Devin)SF$10.2B startupCoding agentsCustomer-facing roleplay + pair programming + architecture. "Customer is angry" simulations.
GleanPalo Alto$7.25BEnterprise search + agentsMLE for AI Assistant + Agents

Tier B continued — Inference / hardware infra (good fit if you want infra pivot)

CompanyHQNotes
Together AISF$3.3B. Open-source inference + post-training infra.
Fireworks AIRedwood City$4B. LLM inference.
BasetenSF$5B. Inference platform.
CerebrasSunnyvale$8.1B. Wafer-scale chips. ~26-day loop, LC med + parallel programming + matmul.
GroqMountain View$6.9B. LPU inference.
ModularLos Altos$1.6B. Mojo + AI inference engine. Chris Lattner.
MatXMountain ViewASICs for LLMs. Ex-Google TPU team.

Strategy: which tier to attack when

  1. Week 1–2 (warmup): apply to Tier B first (Pinterest, Roblox, Databricks, Cohere). Use these onsites as paid practice. Expect to bomb at least one.
  2. Week 3–4: apply to Tier A (DeepMind, xAI, Cursor, PI, World Labs, Mistral, Periodic, Magic).
  3. Week 5–6: apply to Tier S (Anthropic, OpenAI, TML, SSI). By now you'll be sharp from earlier loops.
  4. Always: keep at least 2 active Tier B in your funnel as offer leverage.
Referral > cold app
Frontier labs (S/A) reject ~95% of cold apps at résumé screen. Get a referral. From the company list and your Meta network, you almost certainly have ex-coworkers at Anthropic, OpenAI, DeepMind, Anduril, Pinterest, Databricks. Use LinkedIn alumni filter on "Meta" → current company → message.

Companies dropped (and why)

CompanyReason
Skild AIPittsburgh HQ, no Bay Area
RunwayNY HQ
ModalNY HQ
General IntuitionNY HQ
Liquid AICambridge MA
Reflection (Brooklyn $130m round)Brooklyn (different from $2B SF lab)
AndurilCosta Mesa primary; small Bay Area presence — Watch only
SnapSanta Monica (LA) — bordeline; drop on geo
CrusoeDenver HQ (SF is sales hub) — Watch
Surge AIdata labeling, no Staff modeling roles — Watch C
Aetherfluxspace-power company, not ML
All trading firmsfinance filter
Disneyper your call

Source: aggregated Apr 2026 tier-list data + Trueup AI lists + Levels.fyi May 2026 + recent funding announcements. Verify all comp at offer time on Levels.fyi.