Who takes the Stanford-Binet? From diagnosing intellectual disabilities to identifying giftedness (130+), we break down Full Scale IQ and verbal/nonverbal scores.
Dr. Russell T. WarneChief Scientist
Share
The Stanford-Binet Intelligence Scales is one of the oldest and most respected IQ tests in existence, with a history stretching back over a century. Currently in its fifth edition (SB5), the test measures cognitive abilities across a wide age range, from 2 years old to 85 and beyond, making it one of the most versatile individually administered intelligence assessments available.Â
What Is the History of the Stanford-Binet?
The Stanford-Binet traces its origins to the first successful intelligence test, created by French psychologist Alfred Binet and his colleague Theodore Simon in 1905. Their Binet-Simon Scale used a series of tasks arranged by difficulty level to assess cognitive development in children and identify children who were likely to struggle in school.
The test quickly gained international attention and was translated into multiple languages. Stanford University psychologist Lewis Terman recognized its potential and undertook a major revision for American use, expanding the test's scope and standardizing it on American children. Terman published this revision in 1916 as the Stanford-Binet Intelligence Scale, and the name has remained ever since.
Terman's version used the intelligence quotient (âIQâ) calculated by dividing mental age by chronological age and multiplying by 100. A child performing at exactly the expected level for their age received an IQ of 100, while those performing above or below received correspondingly higher or lower scores. This scoring method, though later replaced by deviation IQ scoring, established the concept and terminology of IQ, which persist today. Â
Subsequent revisions appeared in 1937, 1960, 1986, and 2003. Each revision updated norms, refined the theoretical foundation, and improved psychometric properties. The current fifth edition represents a substantial modernization, aligning the test with contemporaryCattell-Horn-Carroll (CHC) theory while maintaining continuity with the test's distinguished history.
What Does the Stanford-Binet Measure?
The fifth edition of the Stanford-Binet measures five cognitive factors, each assessed through both verbal and nonverbal tasks. This structure produces ten subtests total (five verbal and five nonverbal) plus composite scores at multiple levels.
⢠Fluid Reasoning assesses the ability to solve novel problems without relying on prior knowledge. Verbal fluid reasoning tasks might involve explaining how to solve everyday problems or identifying conceptual relationships, while nonverbal tasks include matrix reasoning items where examinees identify patterns in visual sequences.
⢠Knowledge measures crystallized intelligence, or the accumulated information and skills acquired through education and experience. Verbal knowledge tasks assess vocabulary and general information, while nonverbal knowledge tasks use pictures to assess understanding of concepts without requiring verbal responses.
⢠Quantitative Reasoning evaluates the ability to understand and reason with numerical concepts. Both verbal and nonverbal tasks assess mathematical reasoning, with verbal items presented as word problems and nonverbal items using visual displays of quantitative relationships.
⢠Visual-Spatial Processing measures the ability to perceive and manipulate visual patterns and spatial relationships. Tasks include assembling puzzles, identifying how shapes fit together, and mentally rotating objects.
⢠Working Memory assesses the ability to hold information in mind temporarily while manipulating or using it. Verbal working memory tasks involve remembering sequences of words or sentences, while nonverbal tasks require remembering sequences of visual stimuli or spatial locations.
This five-factor structure aligns with CHC theory, which has become the dominant framework in modern intelligence research. The dual verbal/nonverbal assessment of each factor allows the Stanford-Binet to identify patterns that might be missed by tests assessing only one modality.
How Is the Stanford-Binet Administered?
The Stanford-Binet requires individual administration by a trained examiner, typically a licensed psychologist or supervised trainee with graduate-level training in psychological assessment. Administration time varies considerably depending on the examinee's age and ability level, ranging from approximately 45 minutes for young children or individuals with intellectual disabilities to over two hours for gifted adults completing the full battery.
The test uses a routing procedure to identify the appropriate starting point for each examinee. Two initial subtests, one verbal and one nonverbal, establish the examinee's approximate ability level and determine which items to administer on subsequent subtests. This adaptive approach ensures that examinees receive items appropriate to their ability level rather than spending time on items that are far too easy or difficult.
Examiners follow standardized procedures detailed in the test manual, presenting items verbally, visually, or through hands-on manipulation of materials depending on the subtest. Some tasks are timed while others are untimed. The examiner records responses, scores them according to detailed criteria, and observes the examinee's approach to problem-solving throughout the session.
After administration, raw scores are converted to scaled scores using age-appropriate norms, and composite scores are calculated at multiple levels: subtest scores, factor scores, domain scores (verbal and nonverbal), and the Full Scale IQ representing overall cognitive ability.
What Scores Does the Stanford-Binet Provide?
The Stanford-Binet produces a comprehensive array of scores allowing interpretation at multiple levels of specificity. The Full Scale IQ (FSIQ) summarizes overall cognitive ability across all subtests, providing a global measure comparable to other major IQ tests. Like other modern tests, the Stanford-Binet uses deviation IQ scoring with a mean of 100 and a standard deviation of 15.
Below the Full Scale IQ, the test provides two domain scores: the Verbal IQ, summarizing performance across all verbal subtests, and the Nonverbal IQ, summarizing nonverbal performance. Comparing these scores can reveal modality-specific strengths or weaknesses. Someone might demonstrate strong verbal abilities alongside weaker nonverbal skills, or vice versa.
Five-factor index scores correspond to the cognitive factors described above: Fluid Reasoning, Knowledge, Quantitative Reasoning, Visual-Spatial Processing, and Working Memory. These scores allow more specific analysis of cognitive strengths and weaknesses than the global IQ alone. Finally, ten individual subtest scores (five verbal and five nonverbal) provide the most granular level of analysis, though interpretation at this level requires caution due to lower reliability compared to composite scores.
This hierarchical scoring structure, from specific subtests up through factors and domains to the global IQ, aligns with the hierarchical organization of cognitive abilities described in CHC theory and allows clinicians to examine cognitive functioning at whatever level of detail the referral question requires. Our article on interpreting IQ scores explains how these different score levels are used.
Who Takes the Stanford-Binet?
The Stanford-Binet's age range of 2 to 85+ makes it one of the most versatile IQ tests available. This broad coverage allows the same test to assess cognitive development from early childhood through late adulthood, which is particularly useful for longitudinal research or clinical situations requiring assessment across the lifespan.
In clinical settings, psychologists use the Stanford-Binet to diagnose intellectual disabilities, identify giftedness, evaluate cognitive effects of neurological conditions, and assess changes in cognitive functioning over time. The test's ability to measure very low and very high ability levels makes it particularly useful at the extremes of the distribution, specifically for assessing individuals with significant intellectual disabilities or identifying profoundly gifted children.
Educational settings use the Stanford-Binet for gifted program identification, learning disability evaluation, and educational planning. Many school districts accept Stanford-Binet scores for gifted program placement. The minimum IQ score to qualify for gifted programs varies, but it is typically between 120 and 130 (i.e., the top 2-10% of the population). Â
How Does the Stanford-Binet Compare to Other IQ Tests?
The Stanford-Binet and the Wechsler scales are the two most widely used individually administered IQ tests, and both have strong psychometric properties supported by decades of research. Scores from the two tests correlate highly, typically around .80 to .85, indicating they measure the same underlying construct despite differences in structure and specific content.
The primary structural difference is age coverage. The Stanford-Binet uses a single test across all ages, while the Wechsler system uses three separate tests (WPPSI, WISC, WAIS) for different developmental periods. The Stanford-Binet's approach offers continuity but requires careful norming across a very wide age range. The Wechsler approach allows each test to be optimized for its specific age group but requires transitions between tests as children age.
The Stanford-Binet's five-factor structure with verbal/nonverbal divisions differs from the Wechsler index structure, producing somewhat different score profiles. Clinicians may prefer one or the other depending on the referral question, examinee characteristics, or their own training and experience.
Both tests require individual administration by qualified professionals, making them comparably expensive and time-intensive. For individuals seeking cognitive assessment without clinical evaluation, professionally developed online tests offer an accessible alternative. TheReasoning and Intelligence Online Test (RIOT) measures similar cognitive domains (verbal reasoning, fluid reasoning, spatial ability, working memory, and processing speed) using task formats supported by research.
Developed by Dr. Russell T. Warne with over 15 years of experience in intelligence research, the RIOT meets professional standards established by the American Educational Research Association, American Psychological Association, and the National Council on Measurement in Education. It provides a representative U.S. norm sample, confidence intervals acknowledging measurement precision, and detailed cognitive profiles.Â
Watch âHow Intelligence and IQ Work in the Brainâ with Dr. Richard Haier on the Riot IQ YouTube channel to see how the Stanford-Binetâs structure connects to underlying cognitive processes.