Comparison

Watson Glaser vs SHL: Law Firm Screen or Consulting Gate?

The Watson Glaser Critical Thinking Appraisal and SHL Verify G+ sometimes get lumped together in generic "reasoning test" advice, but they are fundamentally different tests measuring different things. Watson Glaser is a critical thinking test: argument evaluation, inference judgment, assumption recognition. SHL Verify G+ is a general cognitive ability test: numerical reasoning, inductive reasoning, deductive reasoning. If you prep one as if it were the other, you will underperform both.

By PrepClubs Editorial Team, updated April 18, 2026

See pricing

Side-by-side: Watson-Glaser vs SHL

The difference in what they measure is the whole story. Watson Glaser assesses how carefully you evaluate arguments. SHL assesses general reasoning across domains.

	Watson-Glaser	SHL
Full Name	Watson-Glaser Critical Thinking Appraisal	SHL General Ability Test (Verify G+)
Vendor	TalentLens (Pearson)	SHL
Questions	40	~30 (adaptive)
Time Limit	30 minutes	36 minutes
Seconds per Question	45 seconds	~72 seconds
Adaptive	No (linear)	Yes (item-level)
What It Measures	Critical thinking (RED model)	General cognitive ability
Sections	Inference, Assumptions, Deduction, Interpretation, Evaluation of Arguments	Numerical, Inductive, Deductive
Math Content	None	Heavy (numerical module)
Calculator	Not applicable	On-screen calculator
Headline Employers	Clifford Chance, Linklaters, Allen & Overy, Freshfields, Hogan Lovells	Deloitte, PwC, JPMorgan, Barclays, Citi, Unilever
Industry Lean	Law (Magic Circle, US BigLaw)	Consulting, banking, FMCG
Why It Exists	Predict legal reasoning	Predict graduate job performance

Format: two tests measuring different skills

Watson Glaser is a 40-question, 30-minute test organized into five explicit sections based on the RED model (Recognize assumptions, Evaluate arguments, Draw conclusions). The five sections are Inference (8 items), Recognition of Assumptions (8), Deduction (8), Interpretation (8), and Evaluation of Arguments (8). Each section has its own internal logic and answer format. You move through the test linearly.

SHL Verify G+ is a 36-minute adaptive session across three modules (numerical, inductive, deductive) with roughly 30 items total. SHL covers numerical reasoning (math, charts, percentages) heavily. Watson Glaser contains zero math. SHL covers inductive pattern discovery (visual patterns, series). Watson Glaser contains zero pattern recognition. The tests are categorically different in content.

The common thread is that both include deductive reasoning, but even here the implementations differ. Watson Glaser deduction uses formal syllogisms with "true, false, insufficient" answers. SHL deduction uses a mix of conditional reasoning and logical equivalence with a broader answer set. A candidate strong in Watson Glaser deduction is better prepared for SHL deduction than the reverse, because Watson Glaser forces a stricter evaluation habit.

Timing: 45 seconds versus 72 seconds

Watson Glaser gives you roughly 45 seconds per question over 30 minutes for 40 items. Most candidates find this adequate if they resist re-reading. The failure mode is second-guessing: candidates read a passage, form an answer, then re-read and change their answer under the pressure of time. Law firms report that score gains on Watson Glaser come almost entirely from building confidence in first-read answers.

SHL Verify G+ gives you 72 seconds per question over 36 minutes for roughly 30 items. The extra headroom is deceiving because SHL items (especially numerical with multi-step chart calculations) genuinely need the time. You do not have a per-question timer, but you cannot skip or go back in most SHL versions, so each question is a final commitment.

The psychological difference: Watson Glaser rewards calm evaluation under mild time pressure. SHL rewards decisive commitment under heavy content load. Candidates who panic on Watson Glaser usually second-guess themselves into wrong answers. Candidates who panic on SHL usually run out of time on the numerical module because they burned too long on one chart.

Section breakdown: five subskills versus three modules

Watson Glaser measures narrow critical thinking skills. SHL measures broad general ability.

Inference (Watson Glaser) vs Numerical (SHL)

Watson Glaser Inference: 8 items asking you to judge whether a conclusion is true, probably true, insufficient data, probably false, or false given a passage. SHL Numerical: 12 items asking you to extract numerical answers from charts, tables, and financial data. Zero content overlap.

Recognition of Assumptions

Watson Glaser-specific: 8 items asking whether a given assumption is made in a statement. SHL does not test this explicitly. Assumption recognition is a legal reasoning skill that matters in argumentation; it does not matter much in consulting or banking cognitive screens.

Deduction

Both tests include deduction. Watson Glaser Deduction: 8 items using formal syllogisms, evaluated with "conclusion follows / does not follow" format. SHL Deduction: 8 items using syllogisms, conditional reasoning, and logical equivalence. Watson Glaser is stricter in format, SHL is broader in content.

Interpretation and Evaluation (Watson Glaser) vs Inductive (SHL)

Watson Glaser Interpretation: 8 items asking if a conclusion can be inferred from given facts. Watson Glaser Evaluation of Arguments: 8 items asking if an argument is strong or weak. SHL Inductive: 10 items on pattern series and shape relationships. Different skills, different evaluation habits.

Which is actually harder

Difficulty depends on your cognitive profile. Candidates who think in natural language and enjoy dissecting arguments find Watson Glaser easier. Law students and philosophy graduates typically score above the 70th percentile on first attempt. Candidates who think in numbers and patterns find SHL easier. Engineering and STEM graduates typically score above the 75th percentile on first SHL attempt.

The Watson Glaser ceiling is high. Magic Circle law firms (Clifford Chance, Linklaters, Allen & Overy, Freshfields) typically require the 70th to 80th percentile against the legal professional norm group, which is a harder norm group than the graduate norm group most SHL cutoffs use. A Watson Glaser 70th percentile against legal norms roughly equals an 85th percentile against general graduate norms.

The SHL ceiling is also high but accessible via prep. Strong candidates who practice adaptive pacing and calculator fluency can move their SHL composite by 10 to 15 percentile points in 2 weeks of targeted prep. Watson Glaser moves less with prep because argument evaluation habits are harder to shift short-term, but they do shift 5 to 8 percentile points with concentrated practice.

How scores are interpreted

Watson Glaser reports a raw score out of 40 plus a percentile against a selected norm group (graduate, legal professional, managerial). Magic Circle law firms use the legal professional norm group, which compresses the top end: scoring above the 75th percentile against legal norms is quite difficult even for strong candidates. US law firms often use graduate norms which are more forgiving.

SHL Verify G+ reports a percentile against a chosen norm group per module and composite. Big 4 consulting typically requires 80th percentile composite. Investment banking often requires 85 to 90 on numerical specifically. Adaptive scoring means percentile is the only meaningful metric; raw counts are not reported as useful.

A combined implication: candidates applying to both law and consulting (not common but happens for some strategy-and-legal dual-track graduate schemes) cannot just take one test and have it count for the other. Both require separate administration, and the skills are measured differently enough that strong performance on one does not guarantee strong performance on the other.

Where each test lives

Watson-Glaser

Watson Glaser is the gold standard for law firm hiring. UK Magic Circle (Clifford Chance, Linklaters, Allen & Overy, Freshfields, Hogan Lovells) uses it as a mandatory early-stage screen for training contracts and vacation schemes. US BigLaw increasingly uses it for associate hiring. It also appears in legal-adjacent roles (corporate counsel, compliance, legal operations) and occasionally in consulting firms hiring for legal strategy teams.

Clifford ChanceLinklatersAllen & OveryFreshfieldsHogan Lovells

SHL

SHL Verify G+ is the most widely used cognitive test across UK and European graduate hiring in consulting and banking. Big 4 consulting (Deloitte, PwC, KPMG, EY) uses it universally. Investment banking graduate schemes (JPMorgan, Barclays, Citi, Goldman UK) use it. Global FMCG (Unilever, Nestle, Diageo) uses it. Roughly 50 percent of the FTSE 100 uses SHL products. US tech companies use it less frequently than CCAT.

DeloittePwCJPMorganBarclaysCitiUnilever

How prep differs fundamentally

Watson Glaser prep is about training argument evaluation habits. The single highest-leverage drill is the "inference grid" exercise: for each inference item, explicitly write down why each of the five answers (true, probably true, insufficient, probably false, false) could or could not apply given the passage. Doing this on 40 to 60 practice inference items over a week builds a reflexive habit of separating "evidence supports" from "evidence does not contradict." Most candidates lose points by confusing these two.

Watson Glaser deduction prep: master the distinction between "conclusion follows logically" and "conclusion seems intuitively right." Formal syllogisms reward strict logical evaluation. Assumption recognition prep: practice the distinction between "assumption is explicitly stated" (which is never the answer) and "assumption is implicitly required for the argument to hold" (which is usually the answer).

SHL Verify G+ prep is about adaptive pacing and calculator fluency for numerical, plus pattern recognition speed for inductive. 14-day plan: week 1 on numerical (60-to-75-second-per-chart pacing, on-screen calculator shortcuts, business-context vocabulary). Week 2 on inductive (pattern series, shape rule discovery) and deductive (syllogisms, conditional reasoning). Final 2 days on full 36-minute adaptive simulations.

If you somehow face both (rare but possible for dual-track graduate applicants), prep them in entirely separate windows. Do not interleave. The mental postures are different enough that interleaved prep dilutes both. Block out Watson Glaser for week 1, SHL for week 2, and run two separate timed mocks in the final 48 hours.

See pricing

Which one you should actually prep for

If you are a law candidate or applying to a Magic Circle or US BigLaw training contract or associate role: Watson Glaser. Prep with the RED model and drill inference-grid exercises daily for 2 weeks before the test.

If you are a consulting, banking, or FMCG graduate candidate: SHL Verify G+. Prep 14 days with heavy focus on numerical charts and adaptive pacing.

If your career path is ambiguous and you might face either: check your specific employer invitations before prep. Watson Glaser prep does almost nothing for SHL success, and SHL prep does almost nothing for Watson Glaser success. These are not substitutes.

Go deeper

Watson-Glaser Critical Thinking Appraisal

The Watson-Glaser is the gold standard for law firm hiring. It measures inference, assumptions, deduction, interpretation, and argument evaluation. Magic Circle firms lean on it heavily.

Go deeper