How We Test Mattresses: Our Review Methodology Explained
Mattress testing is more procedurally dense than it looks from the outside — and the gap between rigorous evaluation and a weekend's worth of impressions is wide enough to drive a delivery truck through. This page documents the full methodology behind the reviews published on Mattress Review Authority: what gets measured, how measurements are taken, which variables are controlled, and where judgment calls are unavoidable. Knowing the method is the only way to know what a rating actually means.
- Definition and Scope
- Core Mechanics or Structure
- Causal Relationships or Drivers
- Classification Boundaries
- Tradeoffs and Tensions
- Common Misconceptions
- Evaluation Checklist
- Reference Table: Testing Criteria Matrix
Definition and Scope
A mattress testing methodology is a structured protocol for converting physical and perceptual properties of a sleep surface into reproducible, comparable scores. The operative word is reproducible — a methodology only earns that name if a different reviewer, following the same steps, would arrive at a score within a defined margin of the original.
The scope here covers mattresses sold to the US residential market across five construction types: innerspring, memory foam, latex, hybrid, and airbeds. Each type presents distinct measurement challenges. An innerspring mattress behaves differently under body-weight pressure than a memory foam mattress does — coil count and ILD (Indentation Load Deflection) ratings pull the evaluation in different directions. A latex mattress bounces back in under 1 second; a slow-response foam can take 3–5 seconds. These differences aren't better or worse on their own — they're variables that interact with the sleeper's position, weight, and temperature.
The methodology applies to mattresses in the full US standard size range (Twin through Split California King) and does not evaluate mattresses designed for infants, cribs, or clinical/hospital use, which are governed by separate CPSC and ASTM standards.
Core Mechanics or Structure
Testing is organized into 8 scored dimensions. Each dimension carries a defined weight in the composite score, and no dimension is dropped from a review regardless of mattress type.
1. Pressure Relief (20% of composite score)
Measured using a pressure-mapping mat — a grid of sensors that records force distribution across the sleep surface. Test positions include side-lying at the shoulder and hip, and back-lying at the lumbar region. The Consumer Product Safety Commission does not set a standard for residential mattress pressure mapping, so the reference baseline is the Tekscan Body Pressure Measurement System protocol used in published sleep science research.
2. Motion Isolation (15%)
A standardized 10-pound steel ball is dropped from a height of 6 inches at one edge of the mattress while a calibrated accelerometer records vibration amplitude at the center. Lower amplitude = better isolation. The motion isolation dimension carries particular weight in couple-use evaluations.
3. Edge Support (10%)
A 75-pound cylindrical weight is placed 4 inches from the mattress perimeter. Deflection depth in millimeters is recorded after 60 seconds. Edge support failures — where the sleeping surface compresses more than 40mm under load — are flagged separately.
4. Temperature Regulation (15%)
A thermal probe records surface temperature after a standardized foam torso-form (simulating average adult body heat output, approximately 80–100 watts of metabolic heat during sleep per University of California, Berkeley sleep lab protocols) rests on the surface for 8 hours. Delta-T between ambient room temperature (held at 68°F) and mattress surface is the primary metric.
5. Firmness Accuracy (10%)
Manufacturer-stated firmness on the traditional 1–10 scale is compared against ILD measurement. Discrepancies of more than 1.5 points between stated and measured firmness are flagged. See the mattress firmness levels reference for full ILD-to-firmness correlations.
6. Durability Proxy (15%)
A mechanical roller — 220 pounds of applied force across a 4-inch cylinder — completes 30,000 passes across the mattress surface, simulating approximately 10 years of average use based on the International Sleep Products Association (ISPA) durability testing guidelines. Sag depth and foam recovery are measured before and after.
7. Off-Gassing (5%)
VOC (volatile organic compound) levels are recorded using an air quality sensor at mattress surface level for the first 72 hours after unboxing. Certifications like OEKO-TEX Standard 100 and CertiPUR-US are documented but not used as substitutes for direct measurement. The off-gassing and certifications page covers the full certification landscape.
8. Value Assessment (10%)
Price per square inch of sleep surface is calculated and benchmarked against the mattress price tiers framework. A $1,200 queen at 4,800 square inches works out to $0.25/sq in — a figure that gets compared against the segment median.
Causal Relationships or Drivers
Scores in one dimension reliably influence scores in others — and understanding those relationships is what separates analysis from a list of numbers.
Memory foam density (measured in pounds per cubic foot, or PCF) drives both pressure relief and durability scores. A foam rated below 3.0 PCF tends to under-perform on the 30,000-pass roller test, showing sag depths above 1.5 inches — the threshold ISPA uses to define a worn mattress. High-density foams (4.0–5.0 PCF) retain structure but trap heat, which typically pushes temperature regulation scores down.
Coil gauge in innerspring and hybrid mattresses creates a similar tension: lower gauge (thicker wire, e.g., 12-gauge) improves edge support and durability but reduces conforming pressure relief. Higher gauge coils (15–16 gauge) compress more readily under shoulder or hip load — better pressure mapping scores, weaker edge support readings.
Hybrid mattresses exist precisely because of these tradeoffs — pairing a pocketed coil base with a comfort foam layer attempts to capture both mechanical support and pressure-mapping performance. Whether that tradeoff resolves cleanly or creates new problems (particularly in heat retention at the foam-coil interface) is a central question in every hybrid review.
Classification Boundaries
The methodology draws hard lines around what it does and does not evaluate.
In scope: Mattresses sold through US residential retail channels, both online direct-to-consumer and in-store, priced between $300 and $5,000 for a queen size. Mattress-in-a-box products follow the identical protocol as traditionally shipped mattresses — compression shipping introduces a documented re-expansion variable that is controlled by a mandatory 72-hour off-gassing and expansion period before any measurement begins.
Out of scope: Adjustable air mattresses with app-controlled firmness are evaluated using a separate airbed protocol documented in the airbed mattress review section. Mattress toppers are not reviewed under this protocol — the mattress topper vs. new mattress page addresses that distinction directly.
Tradeoffs and Tensions
The 8-dimension composite score compresses a genuinely complex object into a single number, which is useful and misleading in equal measure.
A mattress that scores 8.4 overall might rank 6.0 on temperature regulation and 9.5 on pressure relief. For a hot sleeper with joint pain, that profile is exactly wrong — the aggregate score obscures a dimension-specific failure. Dimension-level scores are always published alongside the composite for this reason.
Subjective firmness perception diverges from ILD measurement by a margin that researchers at the National Sleep Foundation have documented as significant: a mattress with an ILD of 28 (technically "medium") can feel firm to a 120-pound side sleeper and soft to a 220-pound back sleeper. The methodology attempts to control for this by noting the weight profile of the primary tester (held constant at 185 pounds for standardization) and flagging reviews where body-weight interaction would meaningfully change results.
There is also an inherent tension between mattress durability as measured by a proxy test and real-world longevity. The 30,000-pass roller approximates a decade of use but cannot simulate the compounding effects of moisture, variable weight, or temperature cycling.
Common Misconceptions
"A higher coil count always means a better mattress."
Coil count matters only within construction type. A 1,000-coil pocketed system behaves differently than a 1,000-coil Bonnell system. Above approximately 800 individual pocketed coils in a queen, additional coils produce diminishing returns in pressure mapping performance, according to published material from the Sleep Products Safety Council.
"CertiPUR-US certification means no off-gassing."
CertiPUR-US certification, administered by an independent not-for-profit, verifies that foam was manufactured without certain prohibited chemicals (including formaldehyde and heavy metal catalysts). It does not certify zero VOC emissions — certified foams still off-gas at measurable levels for 24–72 hours. The direct measurement in the methodology captures what certification documents cannot.
"Firmness is objective and consistent across brands."
Firmness labeling is entirely unregulated in the US residential mattress market. A mattress labeled "Medium-Firm" by one manufacturer may measure at ILD 22 (the industry baseline for "Medium") while another manufacturer's "Medium" measures at ILD 32. The mattress firmness levels page documents the full distribution.
Evaluation Checklist
The following steps are executed in sequence for every mattress under review. Steps cannot be reordered or skipped.
- Cross-reference against mattress review rating criteria rubric for final score assignment.
Reference Table: Testing Criteria Matrix
| Dimension | Weight | Primary Measurement Tool | Pass Threshold | Notes |
|---|---|---|---|---|
| Pressure Relief | 20% | Tekscan-protocol pressure map | ≤20 mmHg peak shoulder/hip | Position: side-lying |
| Motion Isolation | 15% | Calibrated accelerometer | ≤0.05g vibration amplitude | 10-lb ball, 6-inch drop |
| Edge Support | 10% | Cylindrical weight + depth gauge | ≤40mm deflection | 75-lb load, 4 in. from edge |
| Temperature Regulation | 15% | Thermal probe | ≤4°F surface delta-T | 8-hr contact at 68°F ambient |
| Firmness Accuracy | 10% | ILD indentation tool | ≤1.5 point stated/measured gap | 5-point surface grid |
| Durability Proxy | 15% | Mechanical roller | ≤1.5 in. sag post-simulation | 30,000 passes, 220-lb load |
| Off-Gassing | 5% | VOC air quality sensor | Below EPA Tier 1 residential VOC threshold | Measured at 1, 24, 48, 72 hrs |
| Value | 10% | Price-per-square-inch | Benchmarked to segment median | Queen baseline: 4,800 sq in |