Usability and UX Questionnaires

Please describe the different Usability and UX Questionnaires in terms of

Name of the questionnaire
Source / Weblink / Publication
Administration, Procedure (paper, online, time needed, post-hoc)
Structure (e.g. number of items, constructs)
Quality (e.g. Reliability, Validity, Intrusiveness etc.)

1 AttrakDiff
2 Post-Study Usability Questionnaire (PSSUQ)
3 Standardized User Experience Percentile Rank Questionnaire (SUPR-Q)
4 mHealth App Usability Questionnaire (MAUQ)
5 IsoNorm 9241/10
6 Isometrics
7 System Usability Scale (SUS)
8 User Experience Questionnaire (UEQ)
9 Software Usability Measurement Inventory (SUMI)
10 Usability Metric For User Experience (UMUX)
11 Usability Magnitude Estimation (UME)
12 Website Analysis and Measurement Inventory (WAMMI)

AttrakDiff

Duru & Anton

Source / Weblink / Publication

for general introduction

http://www.attrakdiff.de/index-en.html

to test the questionnaire

https://esurvey.uid.com/survey/#a845f8e5-ac38-471b-a1df-95aa4dd854d1

Administration, Procedure (paper, online, time needed, post-hoc): The test is performed online. There is no defined time to complete the questionnaire.

Goal: To give you an understanding of how users rate the usability and design of your interactive product. It provides you a broad overview of your project.

The link: https://www.kompetenzzentrum-usability.digital/kos/WNetz?art=File.download&id=1296&name=AttrakDiff_EN_UID.pdf

Structure (e.g. number of items, constructs): This questionnaire is made up of 28 items originally, distributed in 4 dimensions, each comprising 7 items. These 28 items in total are scored from -3 to 3.

Dimensions:

1. Pragmatic usability, which measures the usability of the product

2. Hedonic-stimulation usability which measures the stimulation generated by the system.

3. Hedonic-identification usability, which measures the identification of the user with the system.

4. Overall attractiveness, which measures the overall attractiveness of the system, based on pragmatic and hedonic qualities.

Although known for its original version of 28 items, there is also a shorter version of 10 items, to reduce the completion time.

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

The AttrakDiff is an empirically validated user experience questionnaire that measures both pragmatic and hedonic quality (Hassenzahl et al., 2003).

Reliability Cronbachs α Scores of the subscales are: HQ-S α = ,76 - ,90 HQ-I α = ,73 - ,83 PQ α= ,83 - ,85

The reference article: https://dl.gi.de/server/api/core/bitstreams/5a6da40f-5018-49e2-b0fd-58e9f49dad71/content

Pros

You can isolate one of the four subscales, if you want to.

The test is free of charge.

Two language options: German and English.

Graphs are provided with the analysis, which makes it easier to comprehend.

Cons

It is recommended to not to use it for wireframe prototypes. *

Post-Study Usability Questionnaire (PSSUQ)

Theresa & Moritz

Source / Weblink / Publication

https://cdn.uiuxtrend.com/wp-content/uploads/PSSUQ-Questionnaire-PDF-Template.pdf

Administration, Procedure (paper, online, time needed, post-hoc)

Structure (e.g. number of items, constructs)

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

Pros

Cons

Standardized User Experience Percentile Rank Questionnaire (SUPR-Q)

Liubou & Helena

Source / Weblink / Publication: https://measuringu.com/product/suprq/

Administration, Procedure (paper, online, time needed, post-hoc)

Structure (e.g. number of items, constructs)

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

Pros

Cons

mHealth App Usability Questionnaire (MAUQ)

Source / Weblink / Publication: https://pubmed.ncbi.nlm.nih.gov/30973342/ (Description and Abstract only); https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6482399/ (Full text)

Administration, Procedure (paper, online, time needed, post-hoc)

Structure (e.g. number of items, constructs)

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

Pros

Cons

IsoNorm 9241/10

Source / Weblink / Publication

Paper from Jochen Prümper, the author of the questionnaire (in German)

https://www.researchgate.net/publication/290368471_Der_Benutzungsfragebogen_ISONORM_924110_Ergebnisse_zur_Reliabilitat_und_Validitat

Downlaod German Version

https://projekt.kke.tu-berlin.de/wp-content/uploads/2015/09/Methode_Isonorm-Fragebogen.pdf

Paper from Kathrin Figl on IsoNorm and Isometrics, comparing the two (in German)

https://dl.gi.de/bitstream/handle/20.500.12116/6721/Figl_2009.pdf?sequence=2

Administration, Procedure (paper, online, time needed, post-hoc)

Structure (e.g. number of items, constructs)

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

Pros

Cons

Isometrics

Source / Weblink / Publication: Gediga, Günther & Hamborg, Kai-Christoph & Duntsch,. (1999). The IsoMetrics Usability Inventory. An operationalisation of ISO 9241-10 supporting summative and formative evaluation of software systems. Behaviour and Information Technology. 18. 151-164. https://doi.org/10.1080/014492999119057

Administration, Procedure (paper, online, time needed, post-hoc)

The Isometrics test is performed online

The mean time to perform this test is 1.5 hours (long version)
The mean time to perform this test is 30 minutes (short version)

Structure (e.g. number of items, constructs)

The number of items is 75 with an additional weighting and open answers with examples (long version)
The questionnaire consisted of 75 items based on the seven principles of suitability for the task, self-descriptiveness, controllability, conformity with user expectations, error tolerance, suitability for individualization, and suitability for learning.
The number of items is 35 by just using Likert-like questions (short version)
IsoMetrics contains a five point rating for each of the items starting from one (`predominantly disagree’) to (`predominantly agree’) including (`no opinion’) 
Seven scores of the usability dimension to measure the progress of development
Concrete information about malfunctions and their user-perceived attributes
Mean weight of any user-perceived attribute given a class of system malfunctions.

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

It is deemed a valuable and reliable tool
Inter-rater reliability turns out to be very high (Cohen’s kap- pa = 0.95)
High reliability of sub-scales

System Usability Scale (SUS)

Source / Weblink / Publication

Brooke, J., 1996. SUS: A ‘quick and dirty’ usability scale. In: Jordan, P., Thomas, B., Weerdmeester, B. (Eds.), Usability Evaluation in Industry. Taylor & Francis, London, UK, pp. 189–194.

Administration, Procedure (paper, online, time needed, post-hoc)

User fills in the SUS directly after using the app/platform prior to possible discussions
can be done online and paper
free to use, just needs to be mentioned

Structure (e.g. number of items, constructs)

10 items, the odd-numbered items have a positive tone; the even-numbered items a negative tone
rating from 1 to 5 (disagree - strongly agrees)
SUS scores range from 0 to 100 in 2.5 point increments
calculation instructions (excel sheet): http://www.measuringux.com/SUS_Calculation.xls

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

reliability > 0.90
sensitive to different types of devices or systems
significant relationship between SUS scores and a composite metric based on business indicators of success in the marketplace

Potential Disadvantages

misinterpretation
mistake: users do not agree or disagree in the sense they want to because of changing tone
miscode: researchers do calculation error

User Experience Questionnaire (UEQ)

The scales of the questionnaire cover a comprehensive impression of user experience. Both classical usability aspects (efficiency, perspicuity, dependability) and user experience aspects (originality, stimulation) are measured.

Source / Weblink / Publication

You can download the questionnaire for free on this website: https://www.ueq-online.org/
published by the UEQ Team

Administration, Procedure (paper, online, time needed, post-hoc)

questionnaire to evaluate a product based on contrasting attributes on a 7-point-scale
accessible as a PDF download or as an online questionnaire
either filled out online or on paper
Full version and short version available
Time needed for full version approx. 5-10 minutes

Structure (e.g. number of items, constructs)

The UEQ Full Version contains 6 scales with 26 items (https://www.ueq-online.org/Material/Handbook.pdf):

Attractiveness: Overall impression of the product. Do users like or dislike the product?
Perspicuity: Is it easy to get familiar with the product? Is it easy to learn how to use the product?
Efficiency: Can users solve their tasks without unnecessary effort?
Dependability: Does the user feel in control of the interaction?
Stimulation: Is it exciting and motivating to use the product?
Novelty: Is the product innovative and creative? Does the product catch the interest of users?

CopyRight https://www.ueq-online.org/Material/Handbook.pdf

Short version: 8 items 3-5 Minutes

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

gives idea about the perception of different characteristics of a product
good construct validity, measured with Cronbach's Alpha
not intrusive

Software Usability Measurement Inventory (SUMI)

Source: / Weblink / Publication: http://sumi.uxp.ie/en/

Description: ISO/IEC 9126 tries to develop a common understanding of the project's objectives and goals. SUMI is supported by an extensive reference database and an analysis and report generation tool called SUMISCO. The standard is divided into four parts: quality model external metrics internal metrics quality in use metrics

Administration and Procedure:

Online SUMI requires sample sizes with a minimum of about 20 unless your respondents are well selected. Once the questionnaire is filled, submit results to online and a report would be sent. Time : 15 - 20 mins

Structure :

SUMI consists of 50 statements to which the user has to reply that they either Agree, Don't Know, or Disagree. 20 languages The Overall Assessment down into 5 sub-scales: 1.Affect: how much the product captures the user's emotional responses. 2.Efficiency: degree to which the user can achieve the goals of his interaction with the product in a direct and timely manner. 3.Helpfulness; extent to which the product seems to assist the user 4.Control: degree to which the user feels he, and not the product, is setting the pace. 5.Learnability: ease with which a user can get started and learn new features of the product

Quality:

Validity & Reliability: SUMI enables measurement of some of the user-orientated requirements expressed in the European Directive on Minimum Health and Safety Requirements for Work with Display Screen Equipment(90/270/EEC).. Three different kinds of validity studies have been conducted with SUMI: I)the industrial partners within the MUSiC consortium used SUMI as part of the industry-scale validation of the MUSiC usability evaluation toolset. II)there are now a number of laboratory-based studies which have been carried out in the Human Factors Research Group. III)studies which have been carried out for industrial clients on a consultancy basis. consultancy studies are nearly always commissioned on the understanding of strict confidentiality agreements and are not disclosable in public except in vague outline

Example of the report :

Usability Metric For User Experience (UMUX)

Group 2 is doing this :) The UMUX questionnaire was intended to offer a shorter alternative to the 10-item SUS questionnaire. Whereas SUS assesses perceived usability and learnability, UMUX targets usability by assessing effectiveness, efficiency, and satisfaction.

Source / Weblink / Publication: There is not really a direct source that creates this for you, but there are plenty of templates that help you run it.; Here is a good overview of the whole process: https://help.qualaroo.com/hc/en-us/articles/360039072752-UMUX-Usability-Metric-for-User-Experience-

Administration, Procedure: Conducting on paper or online

Structure (e.g. number of items, constructs)

UMUX is a simple four-item questionnaire listing two positive and two negative statements that respondents rate agreement with on a five or seven-point Likert scale. UMUX-Lite is the shorter version which only contains the positive statements.

The four items are:

[This system’s] capabilities meet my requirements.
Using [this system] is a frustrating experience.
[This system] is easy to use.
I have to spend too much time correcting things with [this system].

Scoring

Odd items are scored as [user score - 1]. Even items are scored as [7 - user score].
Add up these differences and divide the sum by 24 (the highest possible score).
Multiply your quotient by 100. Average your results across users.
If you’re using a 5-point Likert scale, you will only have to make a few adjustments to the steps above. Even items should be scored as [5 - user score] and you’ll have to divide the sum by 16 instead of 24.

Quality (e.g. Reliability, Validity, Intrusiveness etc.)

Reliability: Sauro and Lewis (2016) estimated UMUX reliability to be 0.87 and 0.81, and the correlations with the SUS were 0.90 (standard) and 0.79 (positive).

Although scoring might be slightly more complicated with this specific template, the results will ensure that measuring of usability according to its standardized definition and with very little time spent on set-up!

Because the UMUX Questionnaire is evaluated on a Likert scale, we recommend keeping potential respondent biases in mind that are associated with Likert scale reporting.

Social desirability bias - Most people tend to give answers that they consider as socially acceptable rather than being fully honest.
- If your results are overwhelmingly positive, dig in further by reaching out to some of your respondents for in-depth user research interviews.
Central tendency bias - Most people avoid choosing extreme responses such as 1 “Strongly disagree” or 7 “Strongly agree”
- While the UMUX questionnaire was tested and retested to help mitigate this bias, if you are finding that the majority of your responses are neutral, you might consider how the central tendency bias is at work here.

Pros

The UMUX Questionnaire is an ideal length. Shorter than the System Usability Scale. Longer than the Single Ease Questionnaire. UMUX gives you a greater amount of insights that your users can provide quickly!
UMUX evaluates usability based on the official definition and standard established by the ISO.

Cons

Scoring is somewhat more complex when it comes to UMUX, especially if you’re interested in comparing your results to an SUS score.
Because UMUX combines both positive and negative questions, some researchers have found that it creates a two-factor test. However, UMUX-Lite can be used instead, if you find your data showing the same concern.

Usability Magnitude Estimation (UME)

https://journals.sagepub.com/doi/pdf/10.1177/154193120404800536

Website Analysis and Measurement Inventory (WAMMI)

Goal

WAMMI measures user-satisfaction by asking visitors to your website to compare their expectations with what they actually experience on the website

Source / Weblink / Publication

http://www.wammi.com/whatis.html

Administration, Procedure

Online Survey/Questionnaire

Structure (e.g. number of items, constructs)

- Standarised 20-statement questionnaire and a unique international database.

- You customize your survey by asking additional questions of your choice around the 20 statements.

- It is structured into five key scales: Attractiveness, Controllability, Efficiency, Helpfulness and Learnabilit

Reliability

Reliability data rating of between 0.90 and 0.93 . WAMMI is a scientific analytics service, iteratively developed using Psychometric techniques. It has been scientifically proven and has a reliability data rating of between 0.90 and 0.93." " It's measured in terms of five key scales: Attractiveness, Controllability, Efficiency, Helpfulness and Learnability. The Global Usability Score (GUS) exhibits a high degree of reliability, technically measured using Cronbach's Alpha coefficient, with Alpha = 0.90. This coefficient usually ranges from 0.00 (poor reliability) to 1.00 (high reliability).

The five sub-scales, being smaller, have lower intrinsic Alpha coefficients, but when adjusted for size, are comparable with the Global scale:

Scoring

5-Point Likert Scale from 1 (Strongly Agree) to 5 (Strongly Disagree)

Report

Your site is also given an overall Global Usability Score (GUS) for how well visitors rate your site. An average score is 50 and a perfect score is 100. The WAMMI report also contains: - An individual statement analysis, giving you more information about which aspects of your website need improving. - Cross-tabulations of all additional questions (fixed choice answers) with the GUS and the five WAMMI scales. Cross tabulations of visitor responses to check-box questions with the GUS and the five WAMMI scales. - Unedited responses to free-text questions where visitors comment about things not specifically asked for by the WAMMI statements. - Individual visitor profiles and a numeric summary of the WAMMI results.

Pros

WAMMI is currently available in the following languages: Danish, Dutch, English, Finnish, French, German, Italian, Norwegian, Polish, Portuguese (European), Spanish and Swedish.

Cons

You need to include the entire set of sub-scales into your testing for a high reliability.

Usability and UX Questionnaires

Contents

AttrakDiff

Post-Study Usability Questionnaire (PSSUQ)

Standardized User Experience Percentile Rank Questionnaire (SUPR-Q)

mHealth App Usability Questionnaire (MAUQ)

IsoNorm 9241/10

Isometrics

System Usability Scale (SUS)

User Experience Questionnaire (UEQ)

Software Usability Measurement Inventory (SUMI)

Usability Metric For User Experience (UMUX)

Usability Magnitude Estimation (UME)

Website Analysis and Measurement Inventory (WAMMI)

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools