Publication Date
In 2025 | 1 |
Since 2024 | 9 |
Since 2021 (last 5 years) | 25 |
Since 2016 (last 10 years) | 72 |
Since 2006 (last 20 years) | 173 |
Descriptor
Test Items | 245 |
Validity | 245 |
Reliability | 74 |
Test Construction | 67 |
Foreign Countries | 56 |
Scores | 51 |
Item Response Theory | 49 |
Psychometrics | 39 |
Item Analysis | 37 |
Difficulty Level | 36 |
Factor Analysis | 36 |
More ▼ |
Source
Author
Haladyna, Thomas M. | 4 |
Plake, Barbara S. | 3 |
Sireci, Stephen G. | 3 |
Abedi, Jamal | 2 |
Amy Briesch | 2 |
Brittany Melo | 2 |
Cawthon, Stephanie | 2 |
Cliff, Norman | 2 |
Cui, Ying | 2 |
Donoghue, John R. | 2 |
Downing, Steven M. | 2 |
More ▼ |
Publication Type
Education Level
Location
Canada | 11 |
Turkey | 7 |
United States | 6 |
Germany | 5 |
California | 4 |
New York | 4 |
Texas | 3 |
United Kingdom | 3 |
United Kingdom (England) | 3 |
Australia | 2 |
Indonesia | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
LoGiudice, Andrew B.; Norman, Geoffrey R.; Manzoor, Saba; Monteiro, Sandra – Advances in Health Sciences Education, 2023
Students are often encouraged to learn 'deeply' by abstracting generalizable principles from course content rather than memorizing details. So widespread is this perspective that Likert-style inventories are now routinely administered to students to quantify how much a given course or curriculum evokes deep learning. The predictive validity of…
Descriptors: Learning Processes, Transfer of Training, Likert Scales, Generalization
Deng, Jacky M.; Streja, Nicholas; Flynn, Alison B. – Journal of Chemical Education, 2021
Response process validity evidence can provide researchers with insight into how and why participants interpret items on instruments such as tests and questionnaires. In chemistry education research literature and the social sciences more broadly, response process validity evidence has been used and reported in a variety of ways. This paper's…
Descriptors: Chemistry, Science Education, Educational Research, Validity
Ö. Emre C. Alagöz; Thorsten Meiser – Educational and Psychological Measurement, 2024
To improve the validity of self-report measures, researchers should control for response style (RS) effects, which can be achieved with IRTree models. A traditional IRTree model considers a response as a combination of distinct decision-making processes, where the substantive trait affects the decision on response direction, while decisions about…
Descriptors: Item Response Theory, Validity, Self Evaluation (Individuals), Decision Making
Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022
Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…
Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis
Marc Brysbaert – Cognitive Research: Principles and Implications, 2024
Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…
Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis
Jean-Yves Bégin; Luc Touchette; Caroline Couture; Cassandre Blais – International Journal of Nurture in Education, 2020
The Boxall Profile provides a framework for the structured observation of children in nurture groups. It is a detailed and rigorously trialled normative diagnostic instrument developed for teachers and teaching assistants to measure children's levels of emotional and behavioural functioning. Moreover, it highlights specific targets for…
Descriptors: Psychometrics, French, Observation, Children
Svenja Woitt; Joshua Weidlich; Ioana Jivet; Derya Orhan Göksün; Hendrik Drachsler; Marco Kalz – Teaching in Higher Education, 2025
Given the crucial role of feedback in supporting learning in higher education, understanding the factors influencing feedback effectiveness is imperative. Student feedback literacy, that is, the set of attitudes and abilities to make sense of and utilize feedback is therefore considered a key concept. Rigorous investigations of feedback literacy…
Descriptors: Feedback (Response), Higher Education, Multiple Literacies, Teacher Effectiveness
Nájera, Pablo; Sorrel, Miguel A.; Abad, Francisco José – Educational and Psychological Measurement, 2019
Cognitive diagnosis models (CDMs) are latent class multidimensional statistical models that help classify people accurately by using a set of discrete latent variables, commonly referred to as attributes. These models require a Q-matrix that indicates the attributes involved in each item. A potential problem is that the Q-matrix construction…
Descriptors: Matrices, Statistical Analysis, Models, Classification
Gill, Tim – Research Matters, 2022
In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…
Descriptors: Comparative Analysis, Decision Making, Scripts, Standards
Kevser Arslan; Asli Görgülü Ari – Shanlax International Journal of Education, 2024
This study aimed to develop a valid and reliable multiple-choice achievement test for the subject area of ecology. The study was conducted within the framework of exploratory sequential design based on mixed research methods, and the study group consisted of a total of 250 middle school students studying at the sixth and seventh grade level. In…
Descriptors: Ecology, Science Tests, Test Construction, Multiple Choice Tests
Karen Leary Duseau – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
Assessment is a topic of concern to all stakeholders in our educational system. Pattern Based Questions are an assessment tool which is an alternative to the standardized assessment tool, and they are based on generative learning pedagogy, which shows promise in engaging all learners and usefulness in teaching and learning but validity has not yet…
Descriptors: Undergraduate Students, College Mathematics, Mathematics Skills, Thinking Skills
Mo, Ya; Carney, Michele; Cavey, Laurie; Totorica, Tatia – Applied Measurement in Education, 2021
There is a need for assessment items that assess complex constructs but can also be efficiently scored for evaluation of teacher education programs. In an effort to measure the construct of teacher attentiveness in an efficient and scalable manner, we are using exemplar responses elicited by constructed-response item prompts to develop…
Descriptors: Protocol Analysis, Test Items, Responses, Mathematics Teachers
Shan Lin; Jian Wang – Journal of Baltic Science Education, 2024
Scientific thinking constitutes a vital component of scientific competencies, crucial for citizens to adapt to the evolving societal landscape. To cultivate students' scientific thinking, teachers should possess an adequate professional knowledge foundation, which encompasses pedagogical content knowledge (PCK). Assessing teachers' PCK of…
Descriptors: Secondary School Teachers, Teacher Attitudes, Biology, Pedagogical Content Knowledge
Höhne, Jan Karem; Yan, Ting – International Journal of Social Research Methodology, 2020
Web surveys are an established data collection mode that use written language to provide information. The written language is accompanied by visual elements, such as presentation formats and shapes. However, research has shown that visual elements influence response behavior because respondents sometimes use interpretive heuristics to make sense…
Descriptors: Heuristics, Visual Aids, Online Surveys, Response Style (Tests)
Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022
Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…
Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations