How We Rate GLP-1 Programs
Every provider review uses the same five-part scoring framework. Here's exactly how we calculate scores — and why we weight each criterion the way we do.
Scoring Scale: 0–10 points per criterion (10 = best in class)
Scoring Method: Weighted average of 5 criteria = Overall Score
Update Frequency: Scores are recalculated when a provider makes material changes to pricing, medications, coaching, or program structure.
Independence: No provider can purchase a higher score. Methodology is fixed and publicly disclosed.
The Five Scoring Criteria
| Criterion | Weight | What We Evaluate |
|---|---|---|
| Clinical Effectiveness | 30% | Published outcomes data; FDA approval status; clinical trial evidence for prescribed medications; provider-reported average weight loss; duration of studied outcomes |
| Cost & Transparency | 25% | All-in monthly cost including medication; hidden fees; pricing clarity on website; refund and cancellation policies; cost difference with vs. without insurance |
| Program Comprehensiveness | 20% | Behavioral health coaching; dietary guidance; lab work; medication breadth; care team qualifications; follow-up visit structure; app/platform quality |
| Accessibility & Convenience | 15% | State availability; telehealth platform experience; prescription turnaround time; async vs. synchronous consultation options; medication delivery logistics |
| Customer Satisfaction | 10% | Verified patient reviews on Trustpilot, BBB, Google, and App Store; BBB rating and complaint volume; refund dispute patterns; social media sentiment |
Weights reflect the relative importance of each criterion to real-world patient outcomes and value. Clinical effectiveness carries the highest weight because the primary purpose of these programs is medically supervised weight loss — and programs that produce better, documented outcomes deserve higher scores regardless of price.
How We Calculate the Overall Score
Each criterion is scored from 0 to 10 by a combination of quantitative data and qualitative assessment. Quantitative inputs include published outcomes data, verified pricing, state availability counts, and third-party review ratings. Qualitative inputs are scored using benchmarks established across all providers we've reviewed — for example, 'Program Comprehensiveness' uses Calibrate's year-long behavioral curriculum as a 9.5 benchmark and Hims & Hers' coaching-free model as a 4.0 benchmark.
| Score | Classification |
|---|---|
| 8.0–10.0 | Editors' Choice — Exceptional program |
| 7.0–7.9 | Recommended — Strong program with minor limitations |
| 6.0–6.9 | Acceptable — Viable for some patients; notable limitations |
| 5.0–5.9 | Below Average — Significant drawbacks; limited use cases |
| Under 5.0 | Not Recommended — Fundamental program deficiencies |
What We Don't Score
The following factors are explicitly excluded from our scoring to protect editorial integrity:
- Marketing spend or brand visibility
- Whether a provider has an affiliate relationship with GLP1Authority
- Number of patients served (scale alone doesn't indicate quality)
- Advertising impressions or paid placement
- Provider's willingness to cooperate with our review process
Our Data Sources
Primary Sources
- Provider websites and published pricing
- FDA drug shortage database
- Published clinical trial data (STEP, SURMOUNT, SCALE trials)
Third-Party Reviews
- Trustpilot, BBB, App Store reviews
- Google Reviews
- Better Business Bureau complaint data
Community Feedback
- Provider-published outcomes data
- Direct program testing
- Reddit and patient forums
How Often We Update Reviews
We monitor all reviewed programs for material changes. Reviews are updated when a provider changes pricing by more than 10%, adds or removes medications, changes their coaching structure, or makes announcements that affect program quality or safety. All reviews display a 'Last Reviewed' date.
Major changes — such as the Hims & Hers oral semaglutide discontinuation in February 2026 — trigger immediate review updates and are noted prominently in the affected review.