What is the relationship between reliability and validity?
- FLAD
- Foreign Language Assessment Directory
- Understanding Assessment Tutorial
- Introduction
- Validity
- What do I want to know?
- What skills do I want to measure?
- What is the intended purpose of the test?
- How will I use the test results?
- What information will the test provide?
- Show what you know!
- Puzzle Piece
- Reliability
- What is the relationship between reliability and validity?
- How do I determine if a test is reliable for my situation?
- What could affect reliability?
- Show what you know!
- Puzzle Piece
- Practicality
- Do I have the resources to use this test in my classroom?
- What are the practical considerations for test administration?
- What are the practical considerations in scoring a test?
- Show what you know!
- Puzzle Piece
- Impact
- What are the possible effects of a test?
- What does positive washback look like?
- What does negative washback looks like?
- Who will be affected?
- How will different stakeholders be affected?
- Show what you know!
- Puzzle Piece
- Putting It All Together
- Needs Assessment
- Resources
- Heritage Language Assessment Module
- Introduction
- Linguistic Characteristics and Considerations
- Cultural Characteristics and Considerations
- Factors in Language Development
- Program Types
- Implications for Assessment
- Show What You Know!
- Assessing HLLs: The Why
- Assessing HLLs: The What
- Placement Tests
- Formative Assessment
- Summative Assessment
- Examples of Effective Assessment Tasks
- Summary of Best Practices
- Show What You Know!
- Assessing HLLs: The How
- Needs Assessment
- Selecting Assessments
- Modifying Assessments
- Developing Assessments
- Show What You Know!
- Putting It All Together
- Resources
- Introduction
- Post-Secondary World Language Assessment Module
- Introduction
- Proficiency
- Acquiring Proficiency
- Proficiency Levels
- Proficiency-Based Approach to Assessment: The What
- Proficiency-Based Approach to Assessment: The Why
- Proficiency-Based Approach to Assessment: The How
- Types of Assessments
- Summary of Best Practices
- Show What You Know!
- Placement Testing
- Placement Testing: The Why
- Placement Testing: The How
- Types of Assessment Tools and Approaches for Placement
- Selecting Placement Tests
- Additional Considerations
- Using Placement Test Results
- Summary of Best Practices
- Show What You Know!
- Assessment Plans
- Assessment Plans: The Why
- Assessment Plans: The How
- Aligning Assessment with Instruction
- Performance-based Assessment Tasks
- Designing Performance-based Assessment Tasks
- Scoring Performance-based Assessment Tasks
- Using Integrated Performance Assessments
- Designing Integrated Performance Assessments
- Intercultural Communicative Competence
- Assessing Intercultural Communication
- Assessing Cultures
- Assessment and Program Articulation
- Summary of Best Practices
- Show What You Know!
- Putting It All Together
- Resources
Reliability and validity are closely related. To better understand this relationship, letโs step out of the world of testing and onto a bathroom scale.
If the scale is reliable it tells you the same weight every time you step on it as long as your weight has not actually changed. However, if the scale is not working properly, this number may not be your actual weight. If that is the case, this is an example of a scale that is reliable, or consistent, but not valid. For the scale to be valid and reliable, not only does it need to tell you the same weight every time you step on the scale, but it also has to measure your actual weight.
Switching back to testing, the situation is essentially the same. A test can be reliable, meaning that the test-takers will get the same score no matter when or where they take it, within reasonably analogous circumstances. But that doesnโt mean that it is valid, or measuring what it is supposed to measure. A test can be reliable without being valid. However, a test cannot be valid unless it is reliable.
Another way to think of it is that a test can give a consistent, poor result. However, it cannot give a good result unless it is consistent.