The single most important thing a small employer can know about hiring is which selection methods actually forecast who will do the job well — and that answer changed in 2022. For roughly two decades the field’s reference point was Schmidt & Hunter (1998), which ranked general mental ability (GMA, i.e. cognitive ability) at the top with an operational validity around .51 and work samples at .54. Sackett, Zhang, Berry & Lievens (2022) showed that a statistical correction for “range restriction” had been applied too aggressively in the meta-analyses those numbers came from, inflating many validities. When they reanalysed the evidence using more defensible corrections, the magnitudes fell and the ranking reshuffled.

The revised operational validities (Sackett et al., 2022). Higher numbers mean better prediction of job performance; the maximum possible is 1.0.

  • Structured employment interviews — .42 (down from .51)
  • Job-knowledge tests — .40 (down from .48)
  • Empirically-keyed biodata — .38 (up slightly from .35)
  • Work-sample tests — .33 (down sharply from .54)
  • General mental ability / cognitive ability — .31 (down from .51)
  • Integrity tests — .31 (down from .41)
  • Assessment centres — .29 (down from .37)
  • Interests, defined as person-job fit — .24 (up from .10)
  • Conscientiousness, overall — about .19-.21 (down from .31)
  • Unstructured interviews — .19 (down from .38)
  • Years of job experience — .07 (down from .18)

The headline. Cognitive ability is no longer the stand-out single predictor. Structured interviews emerged with the highest mean validity, and job-knowledge tests, empirically-keyed biodata, and work samples sit at or above GMA. Sackett et al. (2023) put it bluntly: one could now treat structured interviews, not cognitive ability, as the focal predictor against which others are compared.

The trap to avoid. A great deal of HR content online — vendor blogs, “best predictor” listicles, even some training decks — still repeats the 1998 figures (GMA on top at r ≈ .51, work samples at .54). Treat those numbers as outdated. The reason is specific and technical: the older meta-analyses corrected for range restriction using artifact distributions that assumed far more restriction than the data actually supported, which systematically overstated validity. If you see “cognitive ability is the #1 predictor at .51,” you are reading pre-2022 numbers.

What did not change. The revision lowered magnitudes; it did not overturn the core practical lesson. The strongest predictors are still job-specific methods built on a real understanding of the role (structured interviews, job-knowledge and work-sample tests, biodata), and combining methods still beats any single one. “Lower than we thought” is not “useless.”

A note on certainty. The .42 for structured interviews carries a wide spread (standard deviation .19; an 80% credibility interval roughly .18-.66), meaning a structured interview can be designed well or badly. The revised numbers are also contested in follow-up literature, so treat any single decimal as an estimate rather than a fixed fact.

For an Ontario SMB, the takeaway is that the cheapest high-validity upgrade available is not a fancier test — it is converting your existing interview into a structured one and adding a second job-relevant method. The dollar case for getting this right lives in the numbers cluster (the cost of a bad hire); the legal limits on what you may test live in the Compliance cluster.