Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023
Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…
Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines
Hsiao, Kuo-Lun; Ku, Ya-Yuan; Lee, Ya-Ting – Education and Information Technologies, 2023
New media literacy is an expected competency for university students. However, few literacy scales can evaluate students' fake news reporting and checking abilities. In the past, the new media literacy framework only included Critical Consuming, Critical Prosumption, Functional Prosumption, and Functional Consuming. Therefore, this study proposes…
Descriptors: Test Construction, Media Literacy, Test Validity, Test Items
Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023
A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…
Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation
Braun, Thorsten; Stierle, Rolf; Fischer, Matthias; Gross, Joachim – Chemical Engineering Education, 2023
Contributing to a competency model for engineering thermodynamics, we investigate the empirical competency structure of our exams in an attempt to answer the question: Do we test the competencies we want to convey to our students? We demonstrate that thermodynamic modeling and mathematical solution emerge as significant dimensions of thermodynamic…
Descriptors: Thermodynamics, Consciousness Raising, Engineering Education, Test Format
Stenger, Rachel; Olson, Kristen; Smyth, Jolene D. – Field Methods, 2023
Questionnaire designers use readability measures to ensure that questions can be understood by the target population. The most common measure is the Flesch-Kincaid Grade level, but other formulas exist. This article compares six different readability measures across 150 questions in a self-administered questionnaire, finding notable variation in…
Descriptors: Readability, Readability Formulas, Computer Assisted Testing, Evaluation Methods
Vitello, Sylvia; Crisp, Victoria; Ireland, Jo – Research Matters, 2023
Assessment materials must be checked for errors before they are presented to candidates. Any errors have the potential to reduce validity. For example, in the most extreme cases, an error may turn an otherwise well-designed exam question into one that is impossible to answer. In Cambridge University Press & Assessment, assessment materials are…
Descriptors: Check Lists, Test Validity, Error Correction, Test Construction
Sarah Wellberg – Research in Mathematics Education, 2023
High-school mathematics teachers tend to use computational, constructed response questions in their classroom tests. However, the rapid shift to distance learning resulting from the COVID-19 pandemic created technological obstacles to using these items. This study investigated teachers' reasons for using particular items and how they adapted their…
Descriptors: High School Teachers, Mathematics Teachers, Computation, Test Items
Lim, Hwanggyu; Choe, Edison M. – Journal of Educational Measurement, 2023
The residual differential item functioning (RDIF) detection framework was developed recently under a linear testing context. To explore the potential application of this framework to computerized adaptive testing (CAT), the present study investigated the utility of the RDIF[subscript R] statistic both as an index for detecting uniform DIF of…
Descriptors: Test Items, Computer Assisted Testing, Item Response Theory, Adaptive Testing
Falcão, Filipe; Pereira, Daniela Marques; Gonçalves, Nuno; De Champlain, Andre; Costa, Patrício; Pêgo, José Miguel – Advances in Health Sciences Education, 2023
Automatic Item Generation (AIG) refers to the process of using cognitive models to generate test items using computer modules. It is a new but rapidly evolving research area where cognitive and psychometric theory are combined into digital framework. However, assessment of the item quality, usability and validity of AIG relative to traditional…
Descriptors: Computer Assisted Testing, Test Construction, Test Items, Automation
Federica Ferretti; Alessandro Gambini; Camilla Spagnolo – European Journal of Science and Mathematics Education, 2024
As highlighted in the literature, one of the main difficulties in mathematics is the management of different semiotic representations. This difficulty occurs in verticals throughout schooling and is often an obstacle to the proper learning process of mathematics. The present study aims to investigate the different facets of these difficulties with…
Descriptors: Semiotics, Mathematics Education, Mathematics Tests, Test Items
Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024
The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…
Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software
Janhavi Mallaiah; Olajide Williams; John P. Allegrante – Health Education & Behavior, 2024
Community health workers (CHWs) are increasingly being required to perform complex health care activities, especially in community cardiovascular disease and stroke prevention. However, currently, there are no psychometrically validated instruments for assessing CHW competencies in these roles. This article describes the development and validation…
Descriptors: Community Health Services, Health Personnel, Test Construction, Test Validity
Sakinah Idris; Femke H. F. ten Hoeve; Allison B. Ratto; Susan W. White; Neeltje van Haren; Kirstin Greaves-Lord – Journal of Autism and Developmental Disorders, 2024
The goal of this study was to translate and adapt the original 9-item of the Contextual Assessment of Social Skills (CASS) to a Dutch version and assess its psychometric qualities. Autistic adolescents aged 12 to 18 years (n = 99) took part in a randomized controlled trial. In this study, pre-intervention data were utilized. The original CASS was…
Descriptors: Psychometrics, Interpersonal Competence, Autism Spectrum Disorders, Adolescents
Nicole ter Wal; Caroline B. Terwee; Johanna M. A. Visser-Meily; Eline Alons; Lotti Dijkhuis; Ellen Gerrits; Lizet van Ewijk – International Journal of Language & Communication Disorders, 2024
Background: People with communication problems experience challenges in participation. Optimizing communicative participation for this population is an important outcome of speech and language therapy. Participation experiences are best assessed from the patient's perspective, using a patient-reported outcome measure (PROM). The Communicative…
Descriptors: Test Construction, Test Validity, Adults, Self Evaluation (Individuals)
Ö. Emre C. Alagöz; Thorsten Meiser – Educational and Psychological Measurement, 2024
To improve the validity of self-report measures, researchers should control for response style (RS) effects, which can be achieved with IRTree models. A traditional IRTree model considers a response as a combination of distinct decision-making processes, where the substantive trait affects the decision on response direction, while decisions about…
Descriptors: Item Response Theory, Validity, Self Evaluation (Individuals), Decision Making

Peer reviewed
Direct link
