How to Build a 3PL Scorecard: 12 Metrics to Evaluate Your Fulfillment Partner

Your third-party logistics provider handles your inventory, packs your orders, and represents your brand in the final mile. If they mess up, your customer doesn't blame the warehouse: they blame you. One-star review, chargeback, lost customer.
Despite these stakes, most e-commerce businesses evaluate their 3PL based on a vague sense of "things seem to be going okay" or "we haven't had any major disasters lately." That's not a strategy. That's hoping.
A formal 3PL scorecard replaces hope with data. It gives you a structured, repeatable way to measure performance, spot deterioration early, and have productive conversations with your fulfillment partner about what needs to improve. This article walks through 12 specific metrics across four categories, shows you exactly how to calculate each one, and provides a scoring framework you can implement this quarter.
Why You Need a Formal Scorecard
There are three practical reasons to formalize your 3PL evaluation:
1. You catch problems before they become crises. A 3PL doesn't go from great to terrible overnight. Performance degrades gradually: accuracy slips from 99.5% to 98.7% over three months, ship times creep from 1.2 days to 1.8 days. Without a scorecard, you don't notice until customers start complaining.
2. You create accountability. When performance expectations are written down with specific numbers and reviewed quarterly, your 3PL team knows exactly what they're being measured on. This changes behavior. Vague expectations produce vague results.
3. You make better decisions about staying or switching. The decision to change 3PLs is expensive and disruptive. A scorecard gives you an objective basis for that decision instead of frustration-driven impulse. It also gives your current 3PL a fair chance to course-correct with specific, actionable feedback.
The 12 Metrics: Four Categories
The scorecard is organized into four categories: Accuracy, Speed, Cost, and Communication. Each category contains three metrics. This structure ensures you're evaluating the full picture, a 3PL that ships fast but inaccurately isn't a good partner, and neither is one that's accurate but unresponsive.
Category 1: Accuracy
Accuracy metrics measure whether your 3PL is sending the right stuff to the right people. This is the foundation. Speed and cost optimization are meaningless if 3% of your orders arrive wrong.
Metric 1: Order Accuracy Rate
This is the single most important metric on the scorecard. It measures the percentage of orders shipped with the correct items, correct quantities, and correct packaging.
Formula:
Order Accuracy Rate = (Total Orders Shipped - Orders with Errors) / Total Orders Shipped × 100
What counts as an error: Wrong item, wrong quantity, missing item, wrong size/color/variant, incorrect packaging (e.g., fragile item shipped without protection).
Benchmark: Industry average for 3PLs is around 97%. You should expect 99%+ from a competent provider. Best-in-class operations hit 99.8%.
Why it matters: Every inaccurate order costs you the original shipping, return shipping, replacement shipping, customer service time, and potential customer lifetime value loss. At scale, a 1% improvement in order accuracy can save tens of thousands of dollars annually.
Metric 2: Inventory Accuracy Rate
This measures how closely the 3PL's reported inventory counts match the actual physical inventory. Discrepancies here cause oversells, stockouts, and phantom inventory problems.
Formula:
Inventory Accuracy Rate = (Number of SKUs with Correct Count) / Total SKUs Audited × 100
Alternatively, you can measure at the unit level:
Unit-Level Accuracy = 1 - (Total Absolute Variance in Units / Total Units on Hand) × 100
Benchmark: 95% at the SKU level is the minimum acceptable threshold. Target 98%+. Best-in-class hits 99.5%+.
How to verify: Request cycle count reports monthly. Conduct your own spot audits quarterly if your contract allows it. Compare the 3PL's reported counts against your OMS or inventory management system's records, discrepancies reveal sync issues that need immediate attention.
Metric 3: Shipping Accuracy Rate
This measures whether the correct carrier, service level, and shipping address were used for each order. It's separate from order accuracy because the contents can be right while the shipping execution is wrong.
Formula:
Shipping Accuracy Rate = (Orders Shipped with Correct Carrier/Service/Address) / Total Orders Shipped × 100
What counts as an error: Wrong carrier used, wrong service level (ground instead of 2-day), wrong address on label, wrong package weight entered (affecting cost).
Benchmark: 99%+ is the expectation. Shipping errors are particularly costly because they often result in late deliveries, reshipments, and carrier surcharges.
Category 2: Speed
Speed metrics capture how quickly your 3PL processes orders and receives inventory. In 2026, delivery speed expectations continue to tighten, 72% of online shoppers expect delivery within 3 days or less according to recent consumer surveys.
Metric 4: Order Processing Time
The time between when an order is received by the 3PL and when it's packed with a shipping label, ready for carrier pickup.
Formula:
Order Processing Time = Timestamp of "Ready to Ship" - Timestamp of "Order Received"
Calculate as an average across all orders for the period. Also track the 95th percentile, the average can hide outliers.
Benchmark: Same-day processing for orders received before the cutoff time (usually 12:00-2:00 PM local). Next business day for orders received after cutoff. Average processing time should be under 4 hours for orders received before cutoff.
Red flag: If processing time is increasing month over month, your 3PL may be understaffed, handling more volume than their facility can support, or dealing with process issues.
Metric 5: Average Ship Time
The elapsed time from order placed (by the customer) to first carrier scan. This is different from processing time because it includes any delays between your system transmitting the order and the 3PL acknowledging receipt.
Formula:
Average Ship Time = Average of (First Carrier Scan Timestamp - Order Placed Timestamp)
Benchmark: Under 24 hours for standard domestic orders. Under 48 hours for complex or custom orders. If your average ship time exceeds 36 hours for standard orders, you're at a competitive disadvantage.
Why track both processing time and ship time: Processing time measures 3PL efficiency. Ship time measures the end-to-end experience from the customer's perspective. If processing time is 3 hours but ship time is 30 hours, you have a handoff problem between your order management system and the 3PL, orders are sitting in a queue before they even reach the warehouse.
Metric 6: Dock-to-Stock Time
How quickly the 3PL receives, inspects, and shelves your inbound inventory so it's available for fulfillment.
Formula:
Dock-to-Stock Time = Timestamp of "Available for Fulfillment" - Timestamp of "Received at Dock"
Benchmark: 24-48 hours for standard replenishment. Same-day for urgent/priority receipts (if your contract includes this).
Why it matters: Every day your inventory sits on a dock unreceived is a day you can't sell it. During peak season or after a stockout, dock-to-stock time directly impacts revenue. If you're waiting 5 days for new inventory to become available after it arrives, that's 5 days of potential stockouts or missed sales.
Category 3: Cost
Cost metrics ensure your 3PL relationship remains economically sound. The cheapest 3PL isn't the best, but you need to understand exactly what you're paying and whether costs are trending in the right direction.
Metric 7: Cost Per Order (CPO)
The all-in cost to fulfill a single order, including pick, pack, materials, and labor.
Formula:
Cost Per Order = Total Fulfillment Charges / Total Orders Shipped
Include all fees: pick fees, pack fees, material fees, insert fees, kitting fees, and any handling surcharges. Exclude outbound shipping costs (those are carrier charges, not 3PL performance).
Benchmark: This varies significantly by industry and order complexity. Single-item, standard-size orders typically cost $3-$6 to fulfill through a 3PL. Multi-item orders with special packaging can run $8-$15+. The important thing is tracking the trend. CPO should decrease as your volume increases, thanks to economies of scale.
Metric 8: Cost Per Unit Stored
Monthly storage cost divided by the average number of units stored.
Formula:
Cost Per Unit Stored = Total Monthly Storage Charges / Average Units in Storage
Some 3PLs charge by pallet, some by cubic foot, some by bin. Normalize to a per-unit cost for apples-to-apples comparison across periods and across providers.
Benchmark: Highly variable by product size. Small items (apparel, electronics accessories): $0.20-$0.50 per unit per month. Medium items: $0.50-$1.50. Oversized items: $2.00+.
Watch for: Seasonal spikes in storage costs, long-term storage fees that kick in after 90-180 days, and minimum storage charges that don't decrease even when inventory levels drop.
Metric 9: Damage and Shrinkage Rate
The percentage of inventory lost or damaged while in the 3PL's care.
Formula:
Damage/Shrinkage Rate = (Units Lost + Units Damaged) / Total Units Handled × 100
Benchmark: Below 0.5% is acceptable. Below 0.1% is excellent. Above 1% is a serious problem that needs immediate investigation.
Why it matters: This is often an invisible cost. If 0.5% of your inventory disappears or gets damaged every month, that's 6% annual shrinkage. On $1 million in inventory, that's $60,000 walking out the door.
Category 4: Communication
Communication metrics are the most overlooked category, but they're often the first indicator of a deteriorating relationship. A 3PL that stops communicating effectively is usually a 3PL with problems they're not telling you about.
Metric 10: Issue Response Time
How quickly your 3PL responds when you raise a problem or ask a question.
Formula:
Average Issue Response Time = Average of (First Response Timestamp - Issue Reported Timestamp)
Track this across communication channels (email, phone, Slack, support portal).
Benchmark: Under 2 hours during business hours for standard issues. Under 30 minutes for urgent issues (stockouts, shipping holds, system outages). If you're waiting 24+ hours for responses to routine questions, your account isn't getting enough attention.
Metric 11: Reporting Frequency and Quality
Does your 3PL proactively share performance data, or do you have to chase them for every number?
Scoring criteria:
- Weekly operational reports delivered on time: Yes (10 points) / No (0 points)
- Monthly performance summaries with trend analysis: Yes (10 points) / Partial (5 points) / No (0 points)
- Real-time dashboard access: Yes (10 points) / No (0 points)
- Proactive alerts for exceptions or issues: Yes (10 points) / No (0 points)
Benchmark: A score of 30+ out of 40 indicates a communicative partner. Below 20 indicates a transparency problem.
Metric 12: Integration Reliability
How consistently the data connection between your systems and the 3PL's systems performs. This includes order transmission, inventory updates, tracking number pushback, and return notifications.
Formula:
Integration Uptime = (Total Hours - Hours of Integration Downtime) / Total Hours × 100
Also track:
Data Sync Error Rate = Failed Sync Attempts / Total Sync Attempts × 100
Benchmark: 99.9% uptime for integrations. Sync error rate below 0.1%. If your integration drops daily or inventory counts regularly fail to sync, you're operating blind.
"The API-first approach let us build custom picking flows in days, not weeks.": David Vance, Tech Lead, Modish Home
A reliable API and strong integration matter enormously here. When your OMS connects directly to your 3PL's systems, you get real-time data flowing both directions: orders out, tracking numbers and inventory counts back. When that connection is unreliable, every other metric on this scorecard becomes harder to measure.
Scoring Methodology: Weighting Your Metrics
Not all 12 metrics carry equal weight. Here's a recommended weighting framework:
| Category | Weight | Rationale |
|---|---|---|
| Accuracy | 35% | Errors directly impact customers and create returns |
| Speed | 25% | Delivery speed drives customer satisfaction and competitiveness |
| Cost | 20% | Must be sustainable, but lowest priority if accuracy and speed are strong |
| Communication | 20% | Early warning system for all other categories |
Within each category, distribute the weight equally among the three metrics unless your business has specific priorities. For example, if you sell fragile goods, you might weight damage/shrinkage rate higher within the Cost category.
Converting Metrics to Scores
For each metric, assign a score from 1 to 5:
| Score | Meaning | Guideline |
|---|---|---|
| 5 | Excellent | Exceeds target consistently |
| 4 | Good | Meets target most periods |
| 3 | Acceptable | Meets minimum threshold |
| 2 | Below Standard | Misses target regularly |
| 1 | Unacceptable | Significantly below threshold |
Calculate the weighted total:
Weighted Score = Σ (Metric Score × Category Weight / Number of Metrics in Category)
Maximum possible score: 5.0
| Overall Score | Assessment |
|---|---|
| 4.5 - 5.0 | Outstanding partner, maintain and grow the relationship |
| 3.5 - 4.4 | Solid partner, address specific weak areas |
| 2.5 - 3.4 | Underperforming, formal improvement plan needed |
| Below 2.5 | Unacceptable, begin evaluating alternatives |
The Complete Scorecard Template
Here's a ready-to-use scorecard table. Fill it in quarterly.
| # | Metric | Category | Weight | Target | Actual | Score (1-5) | Weighted Score |
|---|---|---|---|---|---|---|---|
| 1 | Order Accuracy Rate | Accuracy | 11.7% | 99%+ | ___% | ___ | ___ |
| 2 | Inventory Accuracy Rate | Accuracy | 11.7% | 98%+ | ___% | ___ | ___ |
| 3 | Shipping Accuracy Rate | Accuracy | 11.7% | 99%+ | ___% | ___ | ___ |
| 4 | Order Processing Time | Speed | 8.3% | < 4 hrs | ___ hrs | ___ | ___ |
| 5 | Average Ship Time | Speed | 8.3% | < 24 hrs | ___ hrs | ___ | ___ |
| 6 | Dock-to-Stock Time | Speed | 8.3% | < 48 hrs | ___ hrs | ___ | ___ |
| 7 | Cost Per Order | Cost | 6.7% | $___* | $___ | ___ | ___ |
| 8 | Cost Per Unit Stored | Cost | 6.7% | $___* | $___ | ___ | ___ |
| 9 | Damage/Shrinkage Rate | Cost | 6.7% | < 0.5% | ___% | ___ | ___ |
| 10 | Issue Response Time | Communication | 6.7% | < 2 hrs | ___ hrs | ___ | ___ |
| 11 | Reporting Quality | Communication | 6.7% | 30+/40 | ___/40 | ___ | ___ |
| 12 | Integration Reliability | Communication | 6.7% | 99.9% | ___% | ___ | ___ |
| TOTAL | 100% | ___/5.0 |
*Cost targets vary by business, set these based on your contract terms and industry benchmarks.*
Red Flags That Signal It's Time to Switch 3PLs
A low scorecard number isn't automatically a reason to switch. But certain patterns should trigger a serious evaluation:
- Accuracy below 97% for two consecutive quarters. Everyone has a bad month. Two bad quarters is a systemic problem.
- Declining scores across multiple categories simultaneously. This usually means the 3PL has taken on too much volume or is experiencing management/staffing turnover.
- Repeated missed commitments on improvement plans. If they agree to fix problems and don't, trust is broken.
- Integration reliability below 99% with no improvement plan. If your systems can't talk to each other reliably, you're operating with incomplete data and manual workarounds.
- Cost increases without corresponding service improvements. Annual rate increases are normal. Rate increases alongside declining performance are unacceptable.
- Unresponsive account management. If it takes 48+ hours to get a response to a routine question, you're not a priority.
Before switching, give your current 3PL a formal 90-day improvement window with specific, measurable targets. Some providers genuinely improve when presented with clear data. If they don't, you've documented the basis for your decision and can transition with confidence.
Running Quarterly Business Reviews (QBRs)
The scorecard is strongest when it drives a structured quarterly conversation. Here's a QBR format that works:
Agenda (60-90 minutes):
- Scorecard review (20 min). Walk through each metric. Compare to last quarter. Identify trends.
- Root cause analysis on low scores (15 min). For any metric scoring 2 or below, the 3PL should present their root cause analysis and corrective action plan.
- Volume and capacity planning (10 min). Share your upcoming demand forecast. Discuss whether the 3PL has capacity for peak periods, new product launches, or channel expansion.
- Cost review (10 min). Review any billing discrepancies, discuss upcoming rate changes, and evaluate cost optimization opportunities.
- Technology and process improvements (10 min). Discuss integration upgrades, new capabilities, or process changes that could improve performance.
- Action items and next quarter targets (10 min). Document specific commitments with owners and deadlines.
Who should attend: Your operations lead, their account manager, and (for annual or low-score reviews) leadership from both sides.
The Role of Your OMS in Providing Scorecard Data
Here's a practical reality: most of the data for your 3PL scorecard doesn't come from the 3PL. It comes from your own systems, specifically your order management system.
Your OMS captures the timestamps, status transitions, and exception data that feed nearly every metric on this scorecard. Order accuracy? Your OMS logs customer complaints and returns with reason codes. Ship time? Your OMS records order placement time and carrier scan time. Integration reliability? Your OMS tracks sync successes and failures.
This is why centralized order management matters even if you outsource fulfillment. You need an independent source of truth that isn't controlled by the party being evaluated. If you're relying solely on your 3PL's self-reported data to fill out the scorecard, you're grading their homework.
A system that captures data across all your sales channels, warehouses, and fulfillment partners, and normalizes it into consistent metrics, turns 3PL evaluation from guesswork into a data-driven process. That independence is what makes the scorecard trustworthy.
Putting It Into Practice
Building a 3PL scorecard isn't a weekend project. Here's a realistic timeline:
Week 1: Adopt the 12 metrics from this article. Customize targets based on your contract SLAs and business requirements.
Week 2-3: Audit your current data sources. For each metric, identify where the data lives and whether you can extract it reliably. You'll likely find gaps, that's normal.
Week 4: Fill out the scorecard for the first time using the best data you have. This first pass won't be perfect, and that's fine. It establishes a baseline.
End of Month 2: Share the scorecard with your 3PL. Frame it as a tool for mutual improvement, not a gotcha. Good 3PL partners welcome structured feedback.
Quarterly: Run the QBR. Update the scorecard. Track trends. Adjust targets as your business evolves.
The Bottom Line
Your 3PL relationship is too important and too expensive to manage on gut feel. A formal scorecard with 12 measurable metrics across accuracy, speed, cost, and communication gives you the clarity to hold your partner accountable, catch problems early, and make objective decisions about the most critical vendor relationship in your supply chain.
The 3PLs that welcome this kind of scrutiny are the ones worth keeping. The ones that resist it are telling you something.
Start with the data you have. Build the discipline of quarterly measurement. Let the numbers guide the conversation. Your customers, who experience the results of your 3PL's performance with every delivery, will notice the difference.
Frequently Asked Questions
Use a scorecard with 12 metrics across Accuracy, Speed, Cost, and Communication categories.
Industry average is 97%. Expect 99%+ from a competent provider. Best-in-class hits 99.8%.
Formal QBRs quarterly. Track metrics weekly for internal awareness.
Related Articles
View all
The Merchant Watchlist: 21 Signals to Track Before They Hit Your Margins
Most ecommerce sellers react after costs move. This watchlist shows what merchants should track weekly across freight, tariffs, fuel, suppliers, platforms, demand, and cash.

Amazon Delivery Is Now Measured in Minutes. Can Small Stores Compete?
Amazon is pushing delivery from days to hours to minutes. Small ecommerce brands cannot copy that network, but they can compete with clarity, trust, and smarter promises.

Unified Commerce Is the New Omnichannel, and Most Brands Are Still Behind
Omnichannel is no longer enough if every system still runs in silos. Unified commerce is becoming the operating standard for modern ecommerce.