System Usability Scale: 10 Powerful Insights You Must Know
Ever wondered how users truly feel about a product’s ease of use? The System Usability Scale (SUS) is a simple yet powerful tool that cuts through the noise, delivering reliable insights into user experience—fast, affordable, and effective.
What Is the System Usability Scale (SUS)?

Developed in the late 1980s by John Brooke at Digital Equipment Corporation, the System Usability Scale has become one of the most widely adopted usability assessment tools across industries. It’s a quick, reliable questionnaire designed to measure the perceived usability of a system, whether it’s a website, app, software, or even a physical device with a digital interface.
The brilliance of the SUS lies in its simplicity. It consists of just 10 statements, each rated on a five-point Likert scale ranging from ‘Strongly Disagree’ to ‘Strongly Agree’. Despite its brevity, it delivers a robust quantitative score that reflects overall usability. This makes it ideal for both formative and summative evaluations in user experience (UX) research.
Origins and Development of the SUS
The SUS was first introduced in 1986 as part of usability research at Digital Equipment Corporation. At the time, there was a growing need for a standardized, easy-to-administer method to evaluate how usable systems were from the user’s perspective. Traditional usability testing was often time-consuming and expensive, requiring observation, task completion metrics, and expert analysis.
Brooke’s goal was to create a subjective but reliable tool that could be used across different types of systems and user groups. The result was a 10-item questionnaire that could be completed in under 10 minutes. Over the decades, the SUS has been validated across countless studies and is now considered a gold standard in usability measurement.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
One of the key reasons for its longevity is its agnosticism—it doesn’t favor any specific technology or platform. Whether you’re testing a mobile banking app or a medical device interface, the SUS applies equally well. You can read more about its original development in the original research paper by John Brooke.
Structure and Scoring of the SUS
The SUS questionnaire contains 10 statements, alternating between positive and negative phrasing to reduce response bias. For example:
- I think that I would like to use this system frequently. (Positive)
- I found the system unnecessarily complex. (Negative)
- I thought the system was easy to use. (Positive)
Respondents rate each statement on a five-point scale: 1 = Strongly Disagree, 2 = Disagree, 3 = Neutral, 4 = Agree, 5 = Strongly Agree.
Scoring follows a specific formula:
- For odd-numbered items: Subtract 1 from the user’s response (so a score of 5 becomes 4).
- For even-numbered items: Subtract the user’s response from 5 (so a score of 1 becomes 4).
- Sum all the converted scores and multiply by 2.5 to get the final SUS score, which ranges from 0 to 100.
For example, if a user gives all ‘Strongly Agree’ responses (5s), their raw sum after conversion would be 40, multiplied by 2.5 = 100. A perfect score. Conversely, all ‘Strongly Disagree’ responses yield 0.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
“The SUS is not just a usability metric—it’s a usability benchmark.” — Jeff Sauro, MeasuringU
Why the SUS Is Universally Trusted
Despite its age, the SUS remains relevant because it’s been rigorously tested and validated. Studies have shown high internal consistency (Cronbach’s alpha typically above 0.9), meaning the items in the scale are reliably measuring the same construct: perceived usability.
It’s also been normed across thousands of studies, allowing researchers to compare scores against industry benchmarks. For instance, a score above 68 is considered above average, while anything over 80 is excellent. This comparative power is rare in subjective UX tools.
Moreover, the SUS is language-neutral and culturally adaptable. It’s been translated into over 30 languages and used in diverse global contexts—from evaluating e-government portals in Scandinavia to assessing telehealth apps in Southeast Asia.
How the System Usability Scale Compares to Other Usability Metrics
While the SUS is a staple in UX research, it’s not the only tool available. Understanding how it stacks up against alternatives helps clarify its unique value and appropriate use cases.
Other common usability metrics include the User Experience Questionnaire (UEQ), the Net Promoter Score (NPS), and task-based metrics like success rate, time-on-task, and error rate. Each has strengths, but the SUS stands out for its balance of speed, reliability, and depth.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
SUS vs. UEQ: Depth vs. Breadth
The User Experience Questionnaire (UEQ) expands on the SUS by measuring six dimensions: attractiveness, perspicuity (clarity), efficiency, dependability, stimulation, and novelty. While more detailed, the UEQ requires 26 items, making it longer to administer and analyze.
In contrast, the SUS provides a single, holistic usability score. This makes it faster to deploy, especially in early-stage testing or when working with limited participant pools. However, if you need granular insights into emotional or aesthetic aspects of UX, the UEQ may be more suitable.
That said, many teams use both: SUS for quick benchmarking and UEQ for deeper dives during later design phases.
SUS vs. Task Performance Metrics
Objective metrics like task success rate, time-on-task, and error counts are powerful indicators of usability. They show *what* users do, while SUS reveals *how they feel* about doing it.
For example, a user might complete a checkout process successfully (100% task success) but rate the system poorly on SUS due to perceived complexity or frustration. Conversely, a user might fail a task but still give a high SUS score if they found the interface intuitive and helpful.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
This divergence highlights the importance of combining subjective and objective data. The SUS complements behavioral metrics by capturing the user’s cognitive and emotional experience—something task logs alone can’t reveal.
“Numbers don’t lie, but they don’t tell the whole story. SUS fills the emotional gap.” — UX Researcher, Baymard Institute
SUS vs. Net Promoter Score (NPS)
NPS measures loyalty and willingness to recommend, often used in customer satisfaction surveys. While it can correlate with usability, it’s broader and less specific. A high NPS might reflect brand trust, price, or customer service—not necessarily usability.
The SUS, on the other hand, is laser-focused on usability. It doesn’t ask about likelihood to recommend; it asks about ease of use, complexity, and confidence. This specificity makes it more actionable for design teams trying to improve interface clarity.
That said, some organizations use both: SUS to diagnose usability issues and NPS to gauge overall customer satisfaction.
Step-by-Step Guide to Administering the System Usability Scale
Using the SUS effectively requires more than just handing out a questionnaire. Proper administration ensures reliable, valid results that can inform design decisions.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Here’s a comprehensive guide to deploying the SUS in your next usability study.
When to Use the SUS in Your Research
The SUS is versatile and can be used at multiple stages of the design lifecycle:
- Early Prototypes: Even low-fidelity wireframes can be tested with SUS to catch usability issues before development.
- Mid-Design Iterations: Use SUS to compare different design versions (A/B testing) and track improvements.
- Post-Launch Evaluation: Measure real-world usability after a product goes live.
- Competitive Benchmarking: Test your product against competitors to identify strengths and weaknesses.
Best practice: Administer the SUS immediately after a usability task or session, while the experience is fresh in the user’s mind.
How to Select and Recruit Participants
While the SUS is reliable even with small sample sizes (as few as 5–8 users can detect major issues), larger samples improve statistical confidence.
For benchmarking, aim for at least 15–20 participants. Ensure they represent your target user group—consider demographics, technical proficiency, and prior experience with similar systems.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Recruitment methods include:
- Existing customer panels
- Online recruitment platforms (e.g., UserTesting, Respondent)
- Social media or email campaigns
- In-person intercepts (for location-based services)
Always obtain informed consent and explain the purpose of the study to ensure honest, engaged responses.
Administering the Questionnaire: Best Practices
To maximize data quality, follow these guidelines:
- Use the exact wording: Even small changes can affect reliability. Stick to the standard 10-item format.
- Present items in order: Randomizing item order may reduce bias but breaks standardization.
- Use a consistent scale: Always use a 5-point Likert scale labeled clearly (e.g., 1 = Strongly Disagree, 5 = Strongly Agree).
- Keep it anonymous: Encourage honesty by not requiring names or identifiers.
- Combine with qualitative feedback: Add an open-ended question like, ‘What did you find most confusing?’ to enrich quantitative data.
You can administer the SUS via paper, email, or integrated into digital survey tools like Google Forms, Qualtrics, or SurveyMonkey.
“The SUS is only as good as its administration. Consistency is key.” — UX Consultant, Nielsen Norman Group
Interpreting System Usability Scale Scores: What Do the Numbers Mean?
Getting a SUS score is just the first step. The real value lies in interpreting what that number means for your product and users.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Understanding score ranges, benchmarks, and trends over time turns raw data into actionable insights.
Understanding the SUS Score Range (0–100)
The SUS score ranges from 0 to 100, with no predefined ‘pass/fail’ threshold. However, research has established useful benchmarks:
- 0–67: Below average. Indicates significant usability issues.
- 68: The average SUS score across thousands of studies.
- 68–79: Above average. Generally acceptable, but room for improvement.
- 80–100: Excellent. Represents best-in-class usability (e.g., Google Search, iPhone interface).
For example, a score of 75 suggests your system is usable but could benefit from refinements. A score of 50 signals urgent usability problems that may impact user retention or satisfaction.
Industry Benchmarks and Comparative Analysis
Jeff Sauro and James Lewis, leading UX researchers, have compiled extensive benchmark data from over 5,000 SUS studies. Their findings show:
- Consumer software averages around 78.
- Enterprise software tends to score lower, around 65–70.
- Mobile apps vary widely, but top performers exceed 85.
- Websites average between 60 and 75, depending on complexity.
This data allows you to contextualize your score. If your e-commerce site scores 70, it’s slightly below average for websites—but if your internal HR system scores 70, that’s actually strong for enterprise software.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Comparative testing—running SUS on your product and a competitor’s—can reveal competitive advantages or gaps. A 10-point difference is generally considered meaningful.
Tracking SUS Over Time: Measuring Improvement
One of the most powerful uses of the SUS is longitudinal tracking. By measuring SUS scores across design iterations, you can quantify the impact of UX improvements.
For example:
- Version 1.0: SUS = 62
- After redesign: SUS = 74
- Post-optimization: SUS = 81
This 19-point increase demonstrates clear usability progress. Teams can use such data to justify UX investments to stakeholders, showing tangible ROI on design work.
Tools like MeasuringU’s SUS Calculator automate scoring and provide instant benchmark comparisons.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
“A rising SUS trend is the best proof that your UX efforts are working.” — UX Lead, Spotify Design Team
Common Misuses and Pitfalls of the System Usability Scale
Despite its simplicity, the SUS is often misapplied, leading to misleading results. Avoiding these common mistakes ensures your data remains valid and useful.
Altering the Questionnaire Wording
One of the most frequent errors is rephrasing SUS items to ‘make them clearer’ or adapt to a specific context. While well-intentioned, this undermines the scale’s validity.
The SUS has been psychometrically validated in its original form. Changing even a single word (e.g., replacing ‘system’ with ‘app’) can alter how users interpret the item, making scores incomparable to benchmarks.
Solution: Use the exact wording. If context is unclear, provide a brief introduction (e.g., ‘Please rate your experience with the mobile app’).
Using SUS in Isolation
The SUS is a powerful metric, but it shouldn’t be used alone. Relying solely on SUS risks missing critical behavioral or emotional insights.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
For example, a user might give a high SUS score but struggle silently during tasks, or vice versa. Without observing behavior or collecting qualitative feedback, you won’t understand *why* the score is high or low.
Best practice: Combine SUS with:
- Task success rates
- Think-aloud protocols
- Post-test interviews
- System logs or heatmaps
This mixed-methods approach provides a complete picture of usability.
Ignoring Sample Size and Demographics
While SUS can be used with small samples, drawing broad conclusions from 5 users is risky. Small samples are prone to outliers and may not represent your full user base.
Additionally, mixing user groups (e.g., tech-savvy millennials and older adults with limited digital experience) can skew results. A score of 70 might reflect a wide variance in individual experiences.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Solution: Segment your data by user type and ensure adequate sample size for statistical confidence. Use confidence intervals to express uncertainty in your results.
“SUS is simple, but simplicity doesn’t mean careless.” — Dr. Elizabeth Rosenzweig, UX Researcher
Advanced Applications of the System Usability Scale
Beyond basic usability testing, the SUS is being used in innovative ways across industries and research domains.
These advanced applications demonstrate its flexibility and enduring relevance in modern UX practice.
Using SUS in Healthcare and Medical Devices
In high-stakes environments like healthcare, usability isn’t just about convenience—it’s a matter of safety. The FDA and other regulatory bodies encourage the use of validated usability metrics in medical device evaluation.
The SUS is frequently used in human factors testing for devices like insulin pumps, ECG monitors, and electronic health record (EHR) systems. A low SUS score can signal potential use errors that might lead to patient harm.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
For example, a study published in Applied Ergonomics found that EHR systems with SUS scores below 60 were associated with higher clinician burnout and medication errors.
Because the SUS is quick to administer, it’s ideal for testing with busy healthcare professionals who may have limited time for research participation.
SUS in Educational Technology (EdTech)
With the rise of online learning platforms, the SUS has become a key tool for evaluating educational software. From LMS systems like Canvas to language apps like Duolingo, designers use SUS to ensure interfaces support learning rather than hinder it.
In EdTech, a high SUS score correlates with student engagement, completion rates, and perceived learning effectiveness. A study at MIT found that MOOCs with SUS scores above 75 had 30% higher course completion rates than those below 65.
Researchers also use SUS to compare different instructional designs—e.g., video-based vs. interactive modules—to determine which format users find more intuitive.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Learn more about EdTech usability in this comprehensive study on digital learning tools.
Global and Cross-Cultural Use of SUS
The SUS has been translated into dozens of languages, including Chinese, Arabic, Russian, and Swahili. Its cross-cultural validity has been confirmed through multiple studies, making it a trusted tool in international UX research.
However, cultural differences in response styles (e.g., tendency to agree or avoid extremes) can affect scores. Some researchers apply cultural correction factors or use local norms for interpretation.
For example, users in Japan may give more conservative ratings, while those in Brazil may be more enthusiastic. Understanding these nuances ensures fair comparisons across regions.
“In a global market, SUS is the common language of usability.” — International UX Consortium
Future of the System Usability Scale: Evolution and Alternatives
As technology evolves, so do usability assessment methods. While the SUS remains dominant, new tools and adaptations are emerging to meet modern challenges.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Modern Adaptations of SUS
Researchers have developed several SUS variants to address specific needs:
- mSUS (Mobile SUS): Slight wording adjustments for mobile apps, though studies show standard SUS works well.
- ASQ (After-Scenario Questionnaire): A 3-item mini-SUS used after individual tasks.
- CSUQ (Computer System Usability Questionnaire): More detailed, with subscales for interface quality, information quality, and support.
Despite these, the original SUS remains the most widely used due to its brevity and proven reliability.
Emerging Alternatives and Complementary Tools
New tools like the UX Metrics Framework and Experience Sampling Method (ESM) offer real-time, in-context feedback. Wearables and biometrics (e.g., eye-tracking, heart rate) are also being explored to measure cognitive load and frustration.
However, these methods are often expensive, complex, or intrusive. The SUS’s low cost and ease of use ensure it will remain a staple for years to come.
Will SUS Remain Relevant in the Age of AI?
With the rise of AI-driven interfaces—chatbots, voice assistants, generative UIs—the nature of usability is changing. Can a 1980s-era scale still measure the usability of a conversational agent?
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Preliminary research suggests yes. Studies on Alexa and Google Assistant have successfully used SUS to compare interaction models. However, new dimensions like ‘trust in AI’ or ‘perceived intelligence’ may require supplemental questions.
The core principles of usability—ease of use, learnability, efficiency—remain relevant, even as interfaces evolve. The SUS, with its focus on user perception, is well-positioned to adapt.
“The future of usability isn’t in abandoning old tools, but in evolving them.” — Dr. Jakob Nielsen
What is the System Usability Scale?
The System Usability Scale (SUS) is a 10-item questionnaire used to assess the perceived usability of a system. It provides a quick, reliable score from 0 to 100, with higher scores indicating better usability.
How do you calculate a SUS score?
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
For each odd-numbered item, subtract 1 from the response. For even-numbered items, subtract the response from 5. Sum all adjusted scores and multiply by 2.5 to get the final SUS score.
What is a good SUS score?
A score above 68 is considered above average. Scores over 80 are excellent. However, ‘good’ depends on context—enterprise software may have lower benchmarks than consumer apps.
Can I modify the SUS questionnaire?
No. To maintain validity and comparability with benchmarks, use the exact wording and structure of the original SUS. Modifications can invalidate results.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
How many users do I need for a SUS study?
As few as 5–8 users can reveal major usability issues. For reliable benchmarking, aim for 15–20 or more participants.
The System Usability Scale remains one of the most trusted, efficient, and versatile tools in the UX researcher’s toolkit. From its origins in the 1980s to its current use in AI and global healthcare, the SUS has proven its enduring value. While newer tools emerge, the simplicity, reliability, and benchmarking power of the SUS ensure it will remain a cornerstone of usability evaluation. Whether you’re a designer, product manager, or researcher, mastering the SUS is essential for building products that are not just functional, but truly user-friendly.
system usability scale – System usability scale menjadi aspek penting yang dibahas di sini.
Recommended for you 👇
Further Reading: