Mattress Review Methodology: How We Test and Score Mattresses
Mattress testing sounds deceptively simple — lie on it, decide if it's comfortable — but the gap between a first impression and a durable, useful review is wider than most readers realize. This page explains the structured methodology behind the ratings and evaluations published on Mattress Review Authority: what gets measured, how measurements translate to scores, where judgment calls are inevitable, and where the methodology has honest limitations. The goal is transparency, not just credibility.
- Definition and scope
- Core mechanics or structure
- Causal relationships or drivers
- Classification boundaries
- Tradeoffs and tensions
- Common misconceptions
- Testing checklist
- Reference table: scoring dimensions
Definition and scope
A mattress review methodology is the defined set of physical tests, structured observations, and weighted scoring criteria applied consistently across models to produce comparable ratings. The word "consistently" is doing serious work in that sentence. Without it, a review of a $300 innerspring and a review of a $3,000 latex hybrid are essentially two separate species of opinion — interesting, perhaps, but not comparative.
The scope of a rigorous methodology covers 8 primary performance dimensions: pressure relief, motion isolation, edge support, temperature regulation, responsiveness, noise, durability indicators, and ease of movement across the surface. Each dimension maps to a specific body of sleeper need — side sleepers, for example, weight pressure relief more heavily than stomach sleepers do, which is why the full mattress firmness levels explained framework is a prerequisite for interpreting any single score.
Testing scope also distinguishes between objective physical measurement (deflection depth under load, ILD ratings for foam density) and structured subjective assessment (how a panel of testers at different body weights rates the feel of a surface). Both are necessary. Neither alone is sufficient.
Core mechanics or structure
The testing protocol operates in three phases: physical material assessment, panel-based sleep simulation, and long-term durability projection.
Phase 1 — Material assessment. Foam layers are evaluated against their advertised ILD (Indentation Load Deflection) values, which measure the force in pounds required to compress a 50-square-inch disc 25% into the foam. A standard comfort-layer foam typically lands between 10–20 ILD; support-layer foam runs 25–40 ILD (American Society for Testing and Materials, ASTM D3574). Coil gauges in innerspring and hybrid models are measured; lower gauge numbers (e.g., 13–14) indicate thicker, firmer wire, while higher gauges (e.g., 16–17) indicate softer, more conforming wire.
Phase 2 — Panel sleep simulation. A structured panel of testers covering three body-weight categories — under 130 lbs, 130–230 lbs, and over 230 lbs — assesses each mattress in back, side, and stomach positions. Each tester completes a standardized 15-point scoring rubric immediately after a minimum 20-minute rest period. Panel scores are averaged within weight categories before being combined into composite ratings, preventing the experience of a 280-lb back sleeper from drowning out the equally valid experience of a 115-lb side sleeper. The mattress pressure relief explained and mattress motion isolation explained pages detail how those specific sub-scores are derived.
Phase 3 — Durability projection. Accelerated wear simulation uses a roller mechanism to apply approximately 30,000 compression cycles — equivalent to roughly 10 years of nightly use — and measures sagging depth and foam recovery. The mattress durability and lifespan page documents the exact pass/fail thresholds used.
Causal relationships or drivers
Scores are not arbitrary — each one has a traceable mechanical cause. Edge support ratings, for example, are driven almost entirely by perimeter coil gauge in hybrid models and by foam density (measured in lbs per cubic foot) in all-foam models. A 1.8 lb/ft³ foam edge will compress measurably more than a 2.5 lb/ft³ edge under equivalent load. The mattress edge support explained page walks through exactly this relationship.
Temperature regulation scores are driven by material thermal conductivity, airflow architecture (open-cell vs. closed-cell foam, coil-layer ventilation in hybrids), and cover fabric breathability. Gel-infused memory foam does not perform equivalently to open-cell latex on thermal dissipation — a distinction that matters significantly to the approximately 20% of adults who report sleeping hot, according to the National Sleep Foundation's sleep health data.
Motion isolation is almost exclusively a function of material damping capacity. Latex isolates motion less effectively than dense memory foam because latex has higher elasticity — it springs back rather than absorbing force. A hybrid mattress with a thick foam comfort layer will isolate motion better than a coil-forward innerspring even at the same firmness level.
Classification boundaries
Not every mattress undergoes the same depth of testing. The methodology applies four classification tiers:
- Full protocol — in-lab physical testing plus panel assessment plus durability simulation. Applied to models with manufacturer-supplied samples or purchased units.
- Panel-only — structured tester assessment without independent material measurement. Applied when third-party lab data is available from the manufacturer but no physical sample was obtained.
- Specification review — scoring based entirely on manufacturer-disclosed specs, third-party certifications (OEKO-TEX, CertiPUR-US), and cross-referenced consumer data. Applied to models outside the primary test pool.
- Data-only provider — no score assigned; specifications and certifications documented without rating. The mattress off-gassing and certifications page explains how certification data is used in specification reviews.
Readers seeing a score should check the methodology tag on each review to understand which classification applies. A full-protocol score and a specification-review score are different instruments.
Tradeoffs and tensions
The honest tension in mattress testing is that comfort is partially irreducible to metrics. Two testers of identical weight sleeping in identical positions can legitimately disagree about whether a 28 ILD foam feels "supportive" or "hard." The methodology manages this through panel averaging, but averaging is a compression of information, not a resolution of disagreement.
A second tension: durability simulation is inherently a proxy. Rolling 30,000 compression cycles in a lab is not biologically identical to 10 years of a specific human body at a specific temperature in a specific climate. The correlation is supported by research from the Sleep Products Safety Council (SPSC), but it remains a model, not a guarantee.
Third: price cannot be a scoring variable, but price is deeply relevant to value. A mattress scoring 7.2 out of 10 at $500 represents a different consumer proposition than one scoring 7.4 at $2,800. The mattress price tiers explained page addresses the value-per-dollar framing separately from raw performance scores — deliberately, because conflating them would contaminate the performance data.
Common misconceptions
Misconception: higher coil count always means better quality. Coil count matters only within a given coil type. A 1,000-count individually wrapped pocketed coil system outperforms a 2,000-count Bonnell (open) coil system for motion isolation because coil architecture determines function more than quantity. The innerspring mattress review section addresses coil types in detail.
Misconception: CertiPUR-US certification means the mattress is safe for everyone. CertiPUR-US, administered by a nonprofit established by the polyurethane foam industry, certifies that foam meets specific thresholds for VOC emissions and excludes certain chemical categories. It does not certify overall mattress safety, fire resistance compliance, or suitability for people with chemical sensitivities. The program's scope is accurately described on the CertiPUR-US website.
Misconception: a mattress needs to "feel firm at first" to be durable. Initial firmness and long-term durability are driven by different material properties. A high-density memory foam at 5 lb/ft³ can feel quite soft while outlasting a 1.5 lb/ft³ foam that feels firm out of the box. The mattress break-in period page addresses the confusion between initial feel and structural integrity.
Misconception: online reviews reflect long-term performance. Most consumer reviews are submitted within the first 30–90 days of ownership. Sagging, support loss, and cover degradation typically emerge after 12–18 months. This is why accelerated durability simulation exists — the mattress trial periods and return policies page notes that trial windows are often shorter than the degradation timeline.
Testing checklist
The following elements are documented for every full-protocol review before a score is assigned:
- Foundation compatibility checked against manufacturer specifications (mattress foundation and base compatibility)
Reference table: scoring dimensions
| Dimension | Weight in Overall Score | Primary Measurement Method | Relevant Page |
|---|---|---|---|
| Pressure Relief | 22% | Panel rubric, foam ILD, body-weight-stratified scores | Pressure Relief |
| Support / Spinal Alignment | 20% | Panel rubric by sleep position, ILD stack analysis | Back Sleepers |
| Motion Isolation | 14% | Standardized weight-drop, accelerometer reading | Motion Isolation |
| Edge Support | 10% | Load deflection at perimeter, panel ingress/egress rating | Edge Support |
| Temperature Regulation | 12% | Thermal imaging, material conductivity data, panel heat rating | Hot Sleepers |
| Durability Indicators | 12% | Roller compression cycle results, foam density, coil gauge | Durability |
| Responsiveness / Ease of Movement | 6% | Panel rating, surface latex/foam elasticity | Combination Sleepers |
| Noise | 4% | Decibel measurement under standardized load-shift protocol | Innerspring Review |
Weights reflect the relative importance of each dimension across a general adult sleeper population. Specialty reviews — for side sleepers, heavy sleepers, couples — reweight the matrix to match population-specific priorities. The mattress review rating criteria page documents the full weighting rationale, and how to read a mattress review explains how to apply these scores to a specific situation.