The execution of an intellectually disabled individual violates the Eighth Amendment's prohibition on cruel and unusual punishment, a categorical rule established by the United States Supreme Court in Atkins v. Virginia (2002). However, the operational mechanism for defining and verifying intellectual disability was delegated to individual state jurisdictions. This structural fragmentation created an immediate systemic vulnerability: states seeking to preserve capital sentences implemented rigid, unscientific metrics to bypass the constitutional exemption.
The boundaries of this legal and clinical friction were highlighted in Hamm v. Smith (2026), where the Supreme Court dismissed Alabama’s challenge to a lower court finding that barred the execution of death row inmate Joseph Clifton Smith. By dismissing the case as "improvidently granted," a 5-4 majority preserved the lower courts' reliance on holistic clinical standards rather than a strict numerical cutoff. The decision exposes a deeper systemic vulnerability within post-conviction frameworks: the reliance on binary, static metrics to evaluate complex, continuous biological and psychological traits.
The Tri-Partite Function of Clinical Diagnosis
To understand the breakdown in state-level enforcement, the legal system must be mapped against the established diagnostic criteria used by the medical and psychological communities. Under the American Association on Intellectual and Developmental Disabilities (AAIDD) and the Diagnostic and Statistical Manual of Mental Disorders (DSM-5), an intellectual disability is not an isolated score; it is a tri-partite construct.
+-----------------------------------------------------------------------+
| Intellectual Disability |
+-----------------------------------------------------------------------+
|
+--------------------------+--------------------------+
| | |
v v v
+------------------+ +------------------+ +------------------+
| Component 1 | | Component 2 | | Component 3 |
| | | | | |
| Intellectual | | Adaptive | | Developmental |
| Functioning | | Deficits | | Onset |
| (IQ Scores) | | (Conceptual, | | (Manifestation |
| | | Social, Practical| | Before Age 18) |
| | | Skills) | | |
+------------------+ +------------------+ +------------------+
- Intellectual Functioning Deficiency: Significantly subaverage intellectual functioning, typically operationalized as an Intelligence Quotient (IQ) score approximately two standard deviations below the population mean (a score of 70, given a mean of 100 and a standard deviation of 15).
- Adaptive Deficits: Significant limitations in concurrent adaptive functioning across at least one of three domains: conceptual (literacy, money management), social (interpersonal communication, social judgment), and practical (personal care, occupational tasks).
- Developmental Onset: Evidence that these intellectual and adaptive deficits manifested during the developmental period, universally defined as occurring before the age of 18.
The legal bottleneck occurs almost exclusively within the first component. States frequently treat the standard threshold of 70 as an absolute mathematical ceiling. If an individual scores a 71 or 72, the state’s legal architecture triggers a hard exit condition, precluding any evaluation of the remaining two clinical components. This structural design relies on a fundamental misunderstanding of psychometric measurement error.
The Standard Error of Measurement as a Statistical Limit
The core error in state capital review frameworks is the treatment of an IQ score as a deterministic value rather than a statistical range. Psychometric evaluations are subject to the Standard Error of Measurement ($SEM$), which accounts for variance introduced by testing environments, physiological state, and inherent test design limitations.
The relationship between an observed score and an individual's true intellectual capacity is defined by the formula:
$$True\ Score = Observed\ Score \pm (z \times SEM)$$
In standard psychometric testing, the $SEM$ is approximately 3 to 5 points. A 95% confidence interval ($z = 1.96$) dictates that an observed score of 72 yields a true score range between 67 and 77.
In Hamm v. Smith, Joseph Clifton Smith produced five distinct IQ test scores ranging from 72 to 78. Alabama argued that because these scores sat above the nominal 70-point threshold, Smith failed to meet the threshold requirement for intellectual disability.
The lower courts applied the $SEM$ inversely to capture the lower bound of the confidence interval, identifying that Smith’s true score could mathematically drop to 69. This statistical overlap satisfies the first component of the diagnostic criteria, forcing the court to advance to the second phase: a qualitative assessment of adaptive functioning. The record in Smith's case confirmed profound, documented deficits from an early age in social interaction, independent living, and academic performance, satisfying the holistic diagnostic standard.
The State Optimization Strategy: Score Fishing and Adaptive Distortions
State capital architectures employ specific procedural mechanics to maximize the likelihood of an executable diagnosis. This creates a clear cause-and-effect loop that circumvents the spirit of Atkins:
+---------------------------+ +---------------------------+
| State Strategy: | ---> | Psychometric Consequence: |
| Repeated Psychometric | | Selection Bias via |
| Testing over Time | | Outlier Exploitation |
+---------------------------+ +---------------------------+
|
v
+---------------------------+ +---------------------------+
| Judicial Consequence: | <--- | State Strategy: |
| Erroneous Execution of | | Use Peak Score as |
| Eligible Individuals | | Deterministic Ceiling |
+---------------------------+ +---------------------------+
The execution of Edward Busby in Texas in May 2026—the state's 600th execution in the modern era—demonstrates the operational consequences of this distortion. Despite explicit consensus from both state and defense experts that Busby exhibited clear intellectual disability, structural hurdles and unscientific assessment protocols allowed the capital sentence to stand.
Historically, Texas optimized its denial rate by using the "Briseño factors," a non-clinical set of subjective questions designed to evaluate adaptive behavior based on lay stereotypes rather than psychiatric science (e.g., "Can the individual formulate plans?"). While the Supreme Court rejected this specific framework in Moore v. Texas (2017), states continue to exploit alternative psychometric loopholes.
One prominent strategy is the exploitation of regression to the mean via repeated testing. Because IQ scores naturally fluctuate, administering multiple tests increases the statistical probability of producing an outlier score that exceeds 70. By treating the peak score as a definitive ceiling rather than evaluating the mean or the cumulative distribution of all test results, a state can legally declare a defendant ineligible for protection. This protocol introduces a massive selection bias, leveraging psychometric volatility to achieve a specific legal outcome.
The Jurisprudential Bottleneck: Procedural Default vs. Substantive Merits
The structural division within the Supreme Court in Hamm v. Smith reflects a deep ideological split regarding federal court oversight of state criminal procedures. The 5-4 dismissal, led by the court’s centrist and liberal wings, left the 11th U.S. Circuit Court of Appeals’ holistic framework intact. This model mandates that multiple IQ scores must be analyzed in parallel with adaptive functioning evidence rather than serving as an isolated, exclusionary filter.
In contrast, the dissenting opinion, supported by Justices Thomas, Alito, Roberts, and Gorsuch, focused heavily on federalism and procedural preservation. The dissent argued that federal courts lack the specialized competence to dictate the precise weight assigned to multiple psychometric data points. Furthermore, the dissent highlighted a procedural bottleneck: the specific methodology for aggregating multiple IQ scores had not been thoroughly litigated in the lower state courts before arriving on the federal docket.
This tension reveals a fundamental operational flaw in the post-conviction process. Claims of intellectual disability are routinely denied based on procedural default—such as missing strict state-mandated filing windows or failing to object to a specific psychometric methodology during the initial trial phase—rather than an evaluation of the substantive merits of the clinical diagnosis. Consequently, an individual who meets all clinical criteria for intellectual disability can still be legally executed if their defense counsel fails to navigate the complex procedural web.
Systemic Risk Allocation and Strategic Recommendations
The current framework for evaluating intellectual disability in capital cases creates an asymmetric distribution of risk, favoring state finality over constitutional compliance. The lack of a unified national standard means that a defendant's constitutional exemption depends almost entirely on the state in which the crime was committed. To mitigate these structural failures and realign the legal process with accepted clinical science, the following structural changes are required:
1. Mandatory Mean Aggregation of Psychometric Data
Courts must discard the practice of isolating peak IQ scores to deny claims. When a defendant has undergone multiple psychometric evaluations, the judicial standard must require a weighted mean of all scores administered using validated instruments, adjusted for the Flynn Effect (the documented inflation of IQ scores over time) and the specific $SEM$ of each test version.
2. De-coupling the Diagnostic Order of Operations
The current linear bottleneck—where a failed IQ score instantly terminates the inquiry—must be replaced with a concurrent evaluation system. Courts must be legally required to evaluate adaptive functioning deficits and developmental onset regardless of whether the observed IQ score falls slightly above the nominal threshold of 70. Strong evidence of profound adaptive deficits must be capable of overriding psychometric ambiguity near the borderzone.
3. Federal Standardization of Forensic Evaluations
Because state-level discretion has produced a fragmented system, federal courts must enforce a strict baseline requirement: any state mechanism used to determine intellectual disability must explicitly align with the current editions of either the DSM or the AAIDD manual. Any deviation—such as the application of lay stereotypes or non-clinical behavioral assessments—must be treated as a per se violation of the Eighth Amendment.
The dismissal in Hamm v. Smith prevents the immediate erosion of federal protections for capital defendants with borderline intellectual functioning. However, it does not resolve the underlying structural defect. As long as the legal system permits the use of rigid numerical cutoffs to evaluate continuous, error-prone biological data, the operational execution of Atkins v. Virginia will remain fundamentally broken.