Publication Date
In 2025 | 2 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 18 |
Since 2006 (last 20 years) | 31 |
Descriptor
Inferences | 35 |
Test Reliability | 35 |
Test Validity | 33 |
Test Construction | 12 |
Multiple Choice Tests | 8 |
Foreign Countries | 7 |
Scores | 7 |
Reading Comprehension | 6 |
Statistical Analysis | 6 |
Test Items | 6 |
Test Theory | 6 |
More ▼ |
Source
Author
Biancarosa, Gina | 2 |
Carlson, Sarah E. | 2 |
Crawford, Angela | 2 |
Davison, Mark L. | 2 |
Johnson, Evelyn S. | 2 |
Liu, Bowen | 2 |
Mislevy, Robert J. | 2 |
Moylan, Laura A. | 2 |
Seipel, Ben | 2 |
Zheng, Yuzhu | 2 |
Alexiou, Jon J. | 1 |
More ▼ |
Publication Type
Journal Articles | 26 |
Reports - Research | 20 |
Reports - Evaluative | 9 |
Reports - Descriptive | 4 |
Tests/Questionnaires | 4 |
Information Analyses | 3 |
Dissertations/Theses -… | 2 |
Education Level
Higher Education | 8 |
Elementary Education | 7 |
Postsecondary Education | 6 |
Secondary Education | 6 |
Elementary Secondary Education | 5 |
High Schools | 3 |
Early Childhood Education | 1 |
Grade 12 | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
More ▼ |
Audience
Practitioners | 1 |
Teachers | 1 |
Location
Florida | 2 |
Idaho | 2 |
Malaysia | 2 |
Wisconsin | 2 |
California | 1 |
France | 1 |
Iran | 1 |
Mexico | 1 |
Nigeria | 1 |
Ohio | 1 |
United Kingdom (England) | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Benjamin R. Shear; Derek C. Briggs – Asia Pacific Education Review, 2024
Research in the social and behavioral sciences relies on a wide range of experimental and quasi-experimental designs to estimate the causal effects of specific programs, policies, and events. In this paper we highlight measurement issues relevant to evaluating the validity of causal estimation and generalization. These issues impact all four…
Descriptors: Measurement Techniques, Inferences, COVID-19, Pandemics
Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025
Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…
Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences
Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025
Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…
Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability
Maddox, Bryan – OECD Publishing, 2023
The digital transition in educational testing has introduced many new opportunities for technology to enhance large-scale assessments. These include the potential to collect and use log data on test-taker response processes routinely, and on a large scale. Process data has long been recognised as a valuable source of validation evidence in…
Descriptors: Measurement, Inferences, Test Reliability, Computer Assisted Testing
Rodríguez-Vásquez, Flor Monserrat; Ariza-Hernandez, Francisco J. – EURASIA Journal of Mathematics, Science and Technology Education, 2021
The evaluation of learning in mathematics is a worldwide problem, therefore, new methods are required to assess the understanding of mathematical concepts. In this paper, we propose to use the Item Response Theory to analyze the understanding level of undergraduate students about the real function mathematical concept. The Bayesian approach was…
Descriptors: Bayesian Statistics, Mathematics Education, Item Response Theory, Undergraduate Students
Gotch, Chad M.; French, Brian F. – Educational Assessment, 2020
The State of Washington requires school districts to file court petitions on students with excessive unexcused absences. The "Washington Assessment of Risks and Needs of Students" (WARNS), a self-report screening instrument developed for use by high school and juvenile court personnel in such situations, purports to measure six facets of…
Descriptors: Risk Assessment, Needs Assessment, Truancy, Measurement Techniques
Johnson, Evelyn S.; Moylan, Laura A.; Crawford, Angela; Zheng, Yuzhu – Reading & Writing Quarterly, 2019
In this study, we developed a Reading for Meaning special education teacher observation rubric that detailed the elements of evidence-based comprehension instruction and tested its psychometric properties using many-facet Rasch measurement. We collected video observations of classroom instruction from 10 special education teachers across 3 states…
Descriptors: Scoring Rubrics, Reading Comprehension, Special Education Teachers, Test Construction
Chin, Huan; Chew, Cheng Meng; Lim, Hooi Lian; Thien, Lei Mee – International Journal of Science and Mathematics Education, 2022
Cognitive Diagnostic Assessment (CDA) is an alternative assessment which can give a clear picture of pupils' learning process and cognitive structures to education stakeholders so that appropriate instructional strategies can be designed to tailored pupils' needs. Coincide with this function, the Ordered Multiple-Choice (OMC) items were…
Descriptors: Mathematics Instruction, Mathematics Tests, Multiple Choice Tests, Diagnostic Tests
Aloisi, Cesare; Callaghan, A. – Higher Education Pedagogies, 2018
The University of Reading Learning Gain project is a three-year longitudinal project to test and evaluate a range of available methodologies and to draw conclusions on what might be the right combination of instruments for the measurement of Learning Gain in higher education. This paper analyses the validity of a measure of critical thinking…
Descriptors: Foreign Countries, Cognitive Tests, Critical Thinking, Thinking Skills
Cromley, Jennifer G.; Dai, Ting; Fechter, Tia; Nelson, Frank E.; Van Boekel, Martin; Du, Yang – Grantee Submission, 2021
Making inferences and reasoning with new scientific information is critical for successful performance in biology coursework. Thus, identifying students who are weak in these skills could allow the early provision of additional support and course placement recommendations to help students develop their reasoning abilities, leading to better…
Descriptors: Science Tests, Multiple Choice Tests, Logical Thinking, Inferences
Grundin, Hans U. – Literacy, 2018
This paper aims to present a critical analysis of the Year 1 Phonics Screening Check (PSC), with special focus on the relationship between the UK Department for Education's policy-making and the evidence considered in the process of developing and evaluating the PSC. The reports from the in-house Standards and Testing Agency and from commissioned…
Descriptors: Foreign Countries, Criticism, Screening Tests, Phonics
Johnson, Evelyn S.; Moylan, Laura A.; Crawford, Angela; Zheng, Yuzhu – Grantee Submission, 2018
In this study, we developed a Reading for Meaning special education teacher observation rubric that details the elements of evidence-based comprehension instruction and tested its psychometric properties using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 10 special education teachers across three states…
Descriptors: Scoring Rubrics, Reading Comprehension, Special Education Teachers, Test Construction
Davison, Mark L.; Biancarosa, Gina; Carlson, Sarah E.; Seipel, Ben; Liu, Bowen – Assessment for Effective Intervention, 2018
The computer-administered Multiple-Choice Online Causal Comprehension Assessment (MOCCA) for Grades 3 to 5 has an innovative, 40-item multiple-choice structure in which each distractor corresponds to a comprehension process upon which poor comprehenders have been shown to rely. This structure requires revised thinking about measurement issues…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Pilot Projects, Measurement
Davison, Mark L.; Biancarosa, Gina; Carlson, Sarah E.; Seipel, Ben; Liu, Bowen – Grantee Submission, 2018
The computer-administered Multiple-Choice Online Causal Comprehension Assessment (MOCCA) for Grades 3 to 5 has an innovative, 40-item multiple-choice structure in which each distractor corresponds to a comprehension process upon which poor comprehenders have been shown to rely. This structure requires revised thinking about measurement issues…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Pilot Projects, Measurement
Muijselaar, Marloes M. L. – Scientific Studies of Reading, 2018
We investigated the dimensionality of inference making in samples of 4- to 9-year-olds (Ns = 416-783) to determine if local and global coherence inferences could be distinguished. In addition, we examined the validity of our experimenter-developed inference measure by comparing with three additional measures of listening comprehension. Multitrait,…
Descriptors: Inferences, Thinking Skills, Young Children, Listening Comprehension