RFP Template + Scoring Rubric for Email & SMS Agency Procurement

Why most messaging RFPs fail (and how to fix yours)

Email/SMS procurement went wrong when we treated retention like a creative service. We asked for dribbble shots and subject lines, not plumbing. The result: pretty decks, slow migrations, and KPIs that look busy while finance wonders where payback went. The fix is structural: write an RFP that evaluates systems—deliverability discipline, zero-downtime cutovers, incrementality you can show a CFO, and governance that prevents “we blasted the whole list because it was a big week.”

This guide is not a “tips & tricks” list. It’s an operator’s packet: a copy-ready RFP template, a weighted scoring rubric, interview scripts that smoke out red flags, and a pilot plan that forces reality in 45 days. Use it to select a partner who paints after the plumbing works.

The RFP template (copy/paste sections)

Below is a complete structure you can paste into your procurement doc or portal. Replace bracketed text. Keep the structure—even if you trim prose—because it maps to the scoring rubric and your IT/legal review.

1. Executive Summary

  • Context: Brief business overview, regions, brands, in-scope channels.
  • Objective: Select an agency to improve RPR, second-purchase rate, inbox placement/complaints, reduce discount reliance, and accelerate payback in 90 days.
  • Scope headline: Lifecycle (flows), campaigns, SMS, deliverability, migration (if any), experimentation, analytics, governance.
  • Timeline: RFP release, Q&A, demos, shortlisting, reference checks, award, 45-day pilot.

2. Current State & Stack

  • ESP/SMS/Push: <Klaviyo/Braze>, <Attentive/Postscript>, push <Y/N>
  • Commerce: Shopify (+ Markets), headless, subscriptions/loyalty systems
  • Data: warehouse/CDP (Snowflake/BigQuery/dbt? Reverse ETL Hightouch/Census?)
  • Regions/Languages: list locales; RTL needs
  • Deliverability posture: dedicated domain, DMARC, complaint trends

3. Business Outcomes & KPIs

  • Holdout-adjusted RPR (flows vs. campaigns), 30-day second-purchase rate, complaint/unsub by domain, discount reliance, payback.
  • Secondary: reorder interval, subscription save rate.
  • Reporting rhythm: weekly 10-minute readout; monthly retro with “what changed / learned / test next.”

4. Functional Requirements

4.1 Lifecycle & Campaigns: rebuild post-purchase & second-purchase flows (proof-first); orchestrate SMS as nudge; coverage grid by cohort.

4.2 Experimentation: message-level holdouts, uplift for incentives, bandits for framing; sample size & stopping rules.

4.3 Deliverability: dedicated domain + DMARC, engagement bands, sunset policy, complaint monitoring by domain, incident playbook.

4.4 Global: dynamic templates with language packs; RTL support; regional consent & quiet hours.

4.5 Migration (if applicable): parallel sends, warm-up plan, event mapping, go/no-go criteria, rollback.

5. Security & Compliance

  • SOC 2 posture or control mapping; DPA readiness; sub-processor list; data flow diagrams.
  • Access control & offboarding; secrets management; logging; incident response.
  • 10DLC registration and compliance; consent capture; HELP/STOP keywords; quiet-hours enforcement.
  • GDPR/CCPA: roles (processor vs. controller), subject rights handling, retention & deletion policy.

6. Team & Operating Model

  • Org chart: strategist, deliverability lead, producer, builders, analyst; coverage by timezone/locale.
  • RACI for deliverability, analytics, creative, ops, approvals.
  • SLAs: QA windows, incident response, change control, send freezes.

7. Pricing & Commercials

  • Pricing model: retainer/pod/outcome-based; what’s included, what’s not; change-order policy.
  • Hidden costs surfaced: migrations, reverse ETL, panel seeds, SMS brand/campaign fees.
  • Payment terms, contract length, termination, IP ownership, confidentiality, non-solicit.

8. Vendor Response Instructions

  • Single PDF: executive summary, case studies with holdout-adjusted metrics, security packet, staffing plan, SOW draft, 45-day pilot plan.
  • Mandatory attachments: anonymized “weekly 10-minute” report; deliverability SOP; incident post-mortem.
  • Demo requirements: live template with language pack; deliverability dashboard; holdout readout.

9. Evaluation & Timeline

  • Weights (see rubric): Outcomes 30%, Security 20%, Stack/Migration 20%, Ops/Governance 15%, Team/Pricing 15%.
  • Milestones: Q&A window, demo week, shortlist, references, award, pilot start.

10. Appendices

  • Data definitions (margin, attribution, complaint threshold)
  • Sample SOW clauses (QA SLA, incident response, change freeze)
  • Legal templates: DPA outline, 10DLC checklist
RFP Cover Letter Email (copy-ready)
Subject: RFP: Enterprise Email/SMS/Retention Agency – [Brand]

Hi [Vendor Team],

[Brand] is issuing an RFP to select an enterprise partner for email, SMS, and retention. 
Our 90-day success criteria: improve RPR (flows vs. campaigns), 30-day second-purchase rate, inbox placement/complaints, 
and discount reliance while maintaining payback. Stack: [ESP/SMS/CDP/Commerce]. Security: SOC2 posture/DPA/10DLC/consent.

Timeline:
• RFP Q&A: [dates]  • Demo window: [dates] • Shortlist: [date] • Award: [date] • Pilot start: [date].

Please confirm receipt and share a single-PDF response by [deadline], including:
(1) holdout-adjusted case studies, (2) security packet (data flows, access), (3) staffing plan,
(4) SOW draft with SLAs, (5) 45-day pilot plan.

Thanks,
[Name, Title]
[Procurement/Lifecycle]
      

The scoring rubric (weights, criteria, math)

Score on a 0–5 scale for each criterion, multiply by weight, and sum to 100. Require written evidence or a live demo for any 4–5 score. “Trust us” earns a 2.

Category Weight Criteria (0–5 each)
Outcomes & Incrementality 30% Holdout-adjusted RPR & P2 lift; payback impact; discount reliance trend; uplift tests for incentives; CFO-grade reporting
Security & Compliance 20% SOC2 posture/controls; DPA; data flows; access control; incident response; 10DLC registration/consent; GDPR/CCPA readiness
Stack Fit & Migration 20% Receipts in your ESP/SMS; parallel cutovers; warm-up curves; event/identity mapping; multilingual; warehouse/RTEL alignment
Ops & Governance 15% SLAs; QA checklist; change freeze; weekly 10-min readouts; RACI with named owners; coverage by timezone
Team & Pricing 15% Named senior team; capacity plan; clear scope & change-order policy; transparent fees; surfaced hidden costs

Rubric CSV (paste into Sheets)

Category,Weight,Criterion,Score (0-5),Weighted
Outcomes & Incrementality,0.30,Holdout-adjusted RPR/P2 lift,,
Outcomes & Incrementality,0.30,Payback impact,, 
Outcomes & Incrementality,0.30,Discount reliance trend,, 
Outcomes & Incrementality,0.30,Uplift tests for incentives,, 
Outcomes & Incrementality,0.30,CFO-grade reporting,, 
Security & Compliance,0.20,SOC2/DPA/Data flows,, 
Security & Compliance,0.20,Access control & incident response,, 
Security & Compliance,0.20,10DLC & consent controls,, 
Security & Compliance,0.20,GDPR/CCPA readiness,, 
Stack & Migration,0.20,ESP/SMS receipts,, 
Stack & Migration,0.20,Parallel cutover & warm-up,, 
Stack & Migration,0.20,Event/identity mapping,, 
Stack & Migration,0.20,Warehouse/RTEL alignment,, 
Ops & Governance,0.15,SLAs & QA checklists,, 
Ops & Governance,0.15,Change freeze & readouts,, 
Ops & Governance,0.15,RACI & timezone coverage,, 
Team & Pricing,0.15,Named team & capacity,, 
Team & Pricing,0.15,Scope & hidden costs surfaced,, 
TOTAL,,,
    

Scoring math

Total = Σ(score_i / 5 * weight_i), scaled to 100. Require ≥70 to shortlist; require ≥80 plus clean security to award. Tie-break on pilot plan clarity and reference strength.

Vendor questions that surface truth

  • Open a live deliverability dashboard and show complaint rate by domain for a recent launch. What did you change when Gmail ticked up?
  • Show a holdout readout where RPR improved and discount reliance fell. What changed in creative, cadence, or segmentation?
  • Walk us through a zero-downtime migration: warm-up curve, parallel sends, go/no-go criteria, and how you’d roll back.
  • Open a live template with a language pack/partials; invite a translator to change copy safely.
  • Share your incident post-mortem template; anonymize a real example.
  • Who has production access in client accounts? How do you offboard staff? Show the checklist.
  • What’s your change-freeze policy during peak weeks or placement incidents?
  • Share the 10-minute weekly readout and one decision it changed in the last quarter.

Interview & demo agenda (90 minutes that matter)

  1. 10 min – Outcomes: two holdout-adjusted case studies; CFO dials only.
  2. 15 min – Deliverability: live dashboard; complaint by domain; incident process.
  3. 15 min – Migration/Stack: mapping, warm-up, data checks; sample cutover plan.
  4. 15 min – Global: language packs, RTL demo, regional consent.
  5. 15 min – Ops: SLAs, QA checklists, RACI; weekly readout ritual.
  6. 10 min – Pilot plan: scope, success criteria, kill switch.
  7. 10 min – Q&A: open the floor.

Security & compliance packet (what IT needs)

Ask vendors to submit a single security brief with the following:

  • Data flow diagram (PII ingress/egress; encryption; residency)
  • SOC 2 status or control mapping; pen test summary; vulnerability management
  • DPA template; sub-processor list; retention/deletion policy
  • Access control (SSO, MFA, least-privilege, offboarding)
  • Incident response (detection, escalation, communication timeline)
  • 10DLC registrations; consent policies; HELP/STOP keyword handling; quiet-hours enforcement

Migration & cutover checks (zero downtime)

Migrations fail in two places: deliverability and data fidelity. Your RFP must measure both.

  1. Warm-up: dedicated domain + DMARC; engagement-band sends for 2–3 weeks; seed tests.
  2. Parallel: dual sends on critical flows; compare RPR/placement; complaint watch.
  3. Mapping: event names, identities, suppressions/preferences; recon checklist.
  4. Cutover: go/no-go criteria; rollback plan; post-cutover stabilization.

Deliverability & placement requirements

  • Dedicated domain, aligned DMARC/DKIM, tracking CNAMEs
  • Engagement banding; sunset policy; send-freeze rules
  • Complaint dashboards by domain; placement proxies (seed/panel, read-time)
  • Template standards: real text, alt text, AAA contrast; no image-only emails
  • Incident playbook with owners and timelines

Attribution & incrementality (proof finance accepts)

Require randomized controls on messages that justify spend (saves, recommendations, SMS nudges). Grade agencies on their discipline, not just their math.

  • Message-level holdouts (10–20%)—never removed during big weeks
  • Flow-level controls when changing structure
  • Uplift tests for incentives by risk band—only persuadables get perks
  • Report holdout-adjusted RPR, conversion, AOV, P2-rate, payback

Global & multilingual ops requirements

  • Language capture on profile; footer toggle; preference center
  • Template language packs/partials; translators edit keys, not logic
  • RTL support (dir="rtl" containers; mirrored icons; fonts)
  • Regional consent; quiet hours; regional deliverability history

Pricing & staffing models (and hidden costs)

Common models

  • Retainer: predictable fee for outputs; requires change-order discipline.
  • Pod: cross-functional team dedicated to you; higher velocity, higher cost.
  • Outcome-based: milestone/KPI-linked tranches; align incentives; needs clean baselines.

Hidden costs to surface: migrations and warm-up, reverse ETL, seed panels, SMS brand/campaign fees, additional experimentation traffic, translation ops.

SLAs, SOW & governance (prevent fire drills)

Your SOW is the brake pedal. Bake governance into it.

  • QA SLA: minimum review windows; rendering checks; links/UTMs; segmentation audits
  • Incident response: who pauses sends; who notifies; fix timelines; “no blame” post-mortems
  • Change control: approval steps; change logs; send freezes during risk windows
  • RACI: named owners for deliverability, analytics, creative, ops
  • Reporting rhythm: weekly 10-minute readout; monthly retro with decisions
Sample SOW clauses (excerpt)
QA & SEND SLA. Agency shall provide a minimum QA window of [X] business hours before any scheduled send, including device rendering, link/UTM validation, 
segmentation audits, and suppression checks. No changes may be made within [Y] minutes of send time.

INCIDENT RESPONSE. If complaint rate exceeds [0.08%] on Gmail or seed/panel indicates placement risk, Agency will pause promotional sends, 
notify Client within [60] minutes, and implement the deliverability incident playbook.

PERSISTENT HOLDOUTS. Agency will maintain 10–20% randomized holdouts on save/recommendation messages. Holdouts shall not be removed during “big weeks.”
      

The 45-day pilot: scope, success criteria, kill switch

A pilot forces reality. It protects both parties from three-month “strategy” phases that never touch revenue.

Scope (example)

  • Rebuild post-purchase & second-purchase flows (proof-first modules)
  • Deliverability task: domain warm-up or complaint remediation
  • One SMS nudge with quiet hours + Snooze
  • Holdouts on save/recommendation touches

Success criteria

  • +X% holdout-adjusted RPR on pilot messages
  • +Y pts in 30-day second-purchase rate for exposed cohort
  • Complaints ≤0.08% (Gmail); unsub ≤0.3% targeted sends; SMS opt-out steady or down
  • Discount reliance flat or down

Kill switch

Two strike conditions (e.g., missed SLAs twice, complaint spikes) auto-pause the pilot. If a vendor resists a kill switch, that’s your decision.

Procurement timeline & communications plan

Week Milestone Owner Notes
1 RFP release + brief Procurement Include Intent-Aligned Brief
2 Q&A window Procurement + Lifecycle + Security Publish clarifications
3 Responses due; rubric scoring Evaluation panel Use weighted table
4 Demos/interviews Evaluation panel 90-min script
5 Shortlist + reference checks Procurement Reference script
6 Award + SOW/DPA Legal + Procurement Add kill switch
7 Pilot kickoff Lifecycle + Vendor Weekly 10-min begins
Reference check script (copy-ready)
  1. What measurable change did they deliver in 90 days? (RPR, P2-rate, payback, complaints, discount reliance)
  2. Describe a migration or placement incident. How did they respond?
  3. Were SLAs honored? Did QA prevent avoidable mistakes?
  4. How transparent were they about experiments that failed?
  5. Would you hire them again? Why or why not?

Receipts vs. red flags checklist

Receipts (green lights)

  • Holdout-adjusted case studies; uplift tests for perks
  • Deliverability SOP; incident post-mortem
  • Live template with language pack; translator-safe process
  • Security brief: data flows, access control, DPA
  • Weekly 10-minute reports; decisions changed by data

Red flags

  • Open-rate worship; no incrementality
  • “Warm a domain in a week” promises
  • “We duplicate flows per language” with no governance
  • Pooled team with no named seniors; contractor black box
  • No QA checklist; “agile” used as a synonym for chaos

Downloadables & formats

  • RFP Template — DOCX/Google Doc (sections above)
  • Scoring Rubric — CSV/XLSX (table above)
  • Evaluation Checklist — PDF (security, deliverability, ops, global)
  • Pilot Plan Outline — DOCX/PDF
  • Security Brief Request — DOCX (data flows, sub-processors, access control)

FAQ

How many vendors should we invite?

Three to five is sane. More than five and you’ll drown in decks. If you need breadth, run a two-stage process: short Intent Brief → five vendors → full RFP for three.

What’s a reasonable demo requirement?

Live dashboards (deliverability, complaint by domain), a live template with language packs, and one holdout readout. Screenshots don’t earn 4–5 scores.

Should we require SOC 2 certification?

It helps, but many excellent boutiques map controls without formal certs. Ask for security briefs, DPAs, and control evidence; score accordingly.

What if finance wants hard ROI before award?

That’s what the 45-day pilot is for. Define success criteria up front and include a kill switch. Award the full SOW only after the pilot clears.

Do we need a CDP for this?

No. A warehouse + dbt + reverse ETL can carry most enterprise needs until you have a mature data team and clear CDP use-cases.

Closing: choose plumbing, then paint

The “best agency” is the one that treats email/SMS as infrastructure—deliverability, migration discipline, incrementality, and governance—then paints with proof. If your RFP measures those things, you’ll hire builders. If it measures deck gloss, you’ll buy headlines. Use this template, score with the rubric, run the pilot, and let receipts decide.

Back to blog