ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	14

Descriptor

Statistical Analysis	14
Test Interpretation	14
Scores	8
Test Items	6
Foreign Countries	5
Item Response Theory	5
Mathematics Tests	4
Achievement Tests	3
Reading Tests	3
Simulation	3
Academic Standards	2
Accuracy	2
Alignment (Education)	2
Classification	2
Correlation	2
Error of Measurement	2
Models	2
Reading Skills	2
Regression (Statistics)	2
Science Tests	2
Secondary School Mathematics	2
Secondary School Students	2
Student Evaluation	2
Test Use	2
Word Recognition	2
More ▼

Source

ACT, Inc.	1
Educational Assessment	1
Educational Psychology	1
European Journal of Science…	1
Grantee Submission	1
International Journal of…	1
Journal of Psychoeducational…	1
Oxford Review of Education	1
ProQuest LLC	1
Research Matters	1
School Psychology Review	1
Scientific Studies of Reading	1
Sociological Methods &…	1
Taiwan Journal of TESOL	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	11
Dissertations/Theses -…	1
Guides - General	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Secondary Education	4
Higher Education	3
Postsecondary Education	3
Elementary Education	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 1	1
Grade 2	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

United Kingdom (England)	2
Alabama	1
Indiana	1
Japan	1
Kansas	1
Massachusetts	1
Michigan	1
Minnesota	1
Netherlands	1
New Jersey	1
Ohio	1
Oregon	1
Vermont	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Gray Oral Reading Test	1
Iowa Tests of Basic Skills	1
National Assessment of…	1
Peabody Individual…	1
Test of English for…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Interpretation Evidence for the Multidimensional Test Anxiety Scale: A Brief Report

Peer reviewed

Direct link

Gabrielle Francis; Nathaniel von der Embse; David Putwain; Eunsook Kim – Journal of Psychoeducational Assessment, 2025

Standardized testing is an integral part of the English and American education systems. However, the use of high-stakes testing has unintended consequences, one of which is test anxiety. Over the last 50 years, increased attention has been directed to developing tools to identify students experiencing test anxiety. However, many test anxiety…

Descriptors: Test Anxiety, Secondary School Students, Foreign Countries, Affective Measures

Which Assessment Is Harder? Some Limits of Statistical Linking

Download full text

Benton, Tom; Williamson, Joanna – Research Matters, 2022

Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…

Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Profile Analyses as Feedback by Evaluating the Balance in Exam Scores

Peer reviewed
PDF on ERIC

Download full text

Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019

In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…

Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores

Statistical Classification for Cognitive Diagnostic Assessment: An Artificial Neural Network Approach

Peer reviewed

Direct link

Cui, Ying; Gierl, Mark; Guo, Qi – Educational Psychology, 2016

The purpose of the current investigation was to describe how the artificial neural networks (ANNs) can be used to interpret student performance on cognitive diagnostic assessments (CDAs) and evaluate the performances of ANNs using simulation results. CDAs are designed to measure student performance on problem-solving tasks and provide useful…

Descriptors: Cognitive Tests, Diagnostic Tests, Classification, Artificial Intelligence

How Does Polytomous Item Bias Affect Total-Group Survey Score Comparisons?

Peer reviewed

Direct link

Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017

The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…

Descriptors: Test Items, Test Bias, Item Response Theory, Surveys

Inter-Subject Comparability of Examination Standards in GCSE and GCE in England

Peer reviewed

Direct link

He, Qingping; Stockford, Ian; Meadows, Michelle – Oxford Review of Education, 2018

Results from Rasch analysis of GCSE and GCE A level data over a period of four years suggest that the standards of examinations in different subjects are not consistent in terms of the levels of the latent trait specified in the Rasch model required to achieve the same grades. Variability in statistical standards between subjects exists at both…

Descriptors: Foreign Countries, Exit Examinations, Intellectual Disciplines, Item Response Theory

Interpreting Reading Comprehension Test Results: Quantile Regression Shows That Explanatory Factors Can Vary with Performance Level

Peer reviewed

Direct link

Hua, Anh N.; Keenan, Janice M. – Scientific Studies of Reading, 2017

One of the most important findings to emerge from recent reading comprehension research is that there are large differences between tests in what they assess--specifically, the extent to which performance depends on word recognition versus listening comprehension skills. Because this research used ordinary least squares regression, it is not clear…

Descriptors: Reading Comprehension, Reading Tests, Test Interpretation, Regression (Statistics)

Innovative Assessments That Support Students' STEM Learning

Direct link

Thummaphan, Phonraphee – ProQuest LLC, 2017

The present study aimed to represent the innovative assessments that support students' learning in STEM education through using the integrative framework for Cognitive Diagnostic Modeling (CDM). This framework is based on three components, cognition, observation, and interpretation (National Research Council, 2001). Specifically, this dissertation…

Descriptors: STEM Education, Cognitive Processes, Observation, Psychometrics

ACT Reporting Category Interpretation Guide: Version 1.0. ACT Working Paper 2016 (05)

Download full text

Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016

ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…

Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement

Does Test Item Performance Increase with Test-to-Standards Alignment?

Peer reviewed

Direct link

Traynor, Anne – Educational Assessment, 2017

Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…

Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum

Evaluating the Interpretations and Use of Curriculum-Based Measurement in Reading and Word Lists for Universal Screening in First and Second Grade

Peer reviewed
PDF on ERIC

Download full text

January, Stacy-Ann A.; Ardoin, Scott P.; Christ, Theodore J.; Eckert, Tanya L.; White, Mary Jane – School Psychology Review, 2016

Universal screening in elementary schools often includes administering curriculum-based measurement in reading (CBM-R); but in first grade, nonsense word fluency (NWF) and, to a lesser extent, word identification fluency (WIF) are used because of concerns that CBM-R is too difficult for emerging readers. This study used Kane's argument-based…

Descriptors: Curriculum Based Assessment, Reading Tests, Test Interpretation, Test Use

Self-Assessment Accuracy: Correlations between Japanese English Learners' Self-Assessment on the CEFR--Japan's Can Do Statements and Scores on the TOEIC®

Peer reviewed
PDF on ERIC

Download full text

Runnels, Judith – Taiwan Journal of TESOL, 2016

Since its release in 1979 the TOEIC® (Test of English for International Communication) has been consistently and widely used by educational institutions and companies of Japan despite criticisms that it provides little useable information about language ability. In order to both reduce the extreme focus on and also aid with the practical…

Descriptors: Foreign Countries, English Curriculum, Language Tests, Second Language Learning

Anum Khushal	1
Ardoin, Scott P.	1
Benton, Tom	1
Benítez, Isabel	1
Brian A. Couch	1
Christ, Theodore J.	1
Cui, Ying	1
David Putwain	1
Eckert, Tanya L.	1
Eggen, T.J.H.M.	1
Eunsook Kim	1
Gabrielle Francis	1
Gierl, Mark	1
Guo, Qi	1
Gómez-Benito, Juana	1
Harris, Deborah J.	1
He, Qingping	1
Hidalgo, Ma Dolores	1
Hua, Anh N.	1
January, Stacy-Ann A.	1
Joseph Dauer	1
Keenan, Janice M.	1
Li, Dongmei	1
Lyrica Lucas	1
Meadows, Michelle	1
More ▼