ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	29

Descriptor

Statistical Analysis	295
Test Interpretation	295
Scores	65
Test Reliability	55
Test Results	51
Comparative Analysis	50
Test Construction	47
Test Validity	47
Achievement Tests	43
Evaluation Methods	41
Testing	41
Correlation	39
Mathematical Models	37
Item Analysis	33
Academic Achievement	30
Measurement Techniques	30
Research Methodology	30
Test Items	27
Testing Problems	27
Criterion Referenced Tests	25
Elementary Secondary Education	25
Scoring	25
Standardized Tests	25
Psychometrics	23
Student Evaluation	22
More ▼

Education Level

Higher Education	8
Postsecondary Education	6
Secondary Education	6
Elementary Education	4
Elementary Secondary Education	4
Middle Schools	3
Junior High Schools	2
Early Childhood Education	1
Grade 1	1
Grade 2	1
Grade 7	1
Primary Education	1
More ▼

Audience

Researchers	11
Practitioners	6
Teachers	3
Students	2
Administrators	1
Parents	1
Policymakers	1

Location

Michigan	5
Pennsylvania	3
United Kingdom	3
California	2
New Jersey	2
United Kingdom (England)	2
Alabama	1
Australia	1
California (Berkeley)	1
California (Stanford)	1
Colorado (Denver)	1
Delaware	1
District of Columbia	1
Florida	1
Hawaii	1
Indiana	1
Iran	1
Italy	1
Japan	1
Kansas	1
Massachusetts	1
Minnesota	1
Missouri (Saint Louis)	1
Netherlands	1
Ohio	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	4
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 295 results Save | Export

Interpretation Evidence for the Multidimensional Test Anxiety Scale: A Brief Report

Peer reviewed

Direct link

Gabrielle Francis; Nathaniel von der Embse; David Putwain; Eunsook Kim – Journal of Psychoeducational Assessment, 2025

Standardized testing is an integral part of the English and American education systems. However, the use of high-stakes testing has unintended consequences, one of which is test anxiety. Over the last 50 years, increased attention has been directed to developing tools to identify students experiencing test anxiety. However, many test anxiety…

Descriptors: Test Anxiety, Secondary School Students, Foreign Countries, Affective Measures

Which Assessment Is Harder? Some Limits of Statistical Linking

Download full text

Benton, Tom; Williamson, Joanna – Research Matters, 2022

Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…

Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Profile Analyses as Feedback by Evaluating the Balance in Exam Scores

Peer reviewed
PDF on ERIC

Download full text

Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019

In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…

Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores

Statistical Classification for Cognitive Diagnostic Assessment: An Artificial Neural Network Approach

Peer reviewed

Direct link

Cui, Ying; Gierl, Mark; Guo, Qi – Educational Psychology, 2016

The purpose of the current investigation was to describe how the artificial neural networks (ANNs) can be used to interpret student performance on cognitive diagnostic assessments (CDAs) and evaluate the performances of ANNs using simulation results. CDAs are designed to measure student performance on problem-solving tasks and provide useful…

Descriptors: Cognitive Tests, Diagnostic Tests, Classification, Artificial Intelligence

How Does Polytomous Item Bias Affect Total-Group Survey Score Comparisons?

Peer reviewed

Direct link

Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017

The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…

Descriptors: Test Items, Test Bias, Item Response Theory, Surveys

Inter-Subject Comparability of Examination Standards in GCSE and GCE in England

Peer reviewed

Direct link

He, Qingping; Stockford, Ian; Meadows, Michelle – Oxford Review of Education, 2018

Results from Rasch analysis of GCSE and GCE A level data over a period of four years suggest that the standards of examinations in different subjects are not consistent in terms of the levels of the latent trait specified in the Rasch model required to achieve the same grades. Variability in statistical standards between subjects exists at both…

Descriptors: Foreign Countries, Exit Examinations, Intellectual Disciplines, Item Response Theory

Interpreting Reading Comprehension Test Results: Quantile Regression Shows That Explanatory Factors Can Vary with Performance Level

Peer reviewed

Direct link

Hua, Anh N.; Keenan, Janice M. – Scientific Studies of Reading, 2017

One of the most important findings to emerge from recent reading comprehension research is that there are large differences between tests in what they assess--specifically, the extent to which performance depends on word recognition versus listening comprehension skills. Because this research used ordinary least squares regression, it is not clear…

Descriptors: Reading Comprehension, Reading Tests, Test Interpretation, Regression (Statistics)

Innovative Assessments That Support Students' STEM Learning

Direct link

Thummaphan, Phonraphee – ProQuest LLC, 2017

The present study aimed to represent the innovative assessments that support students' learning in STEM education through using the integrative framework for Cognitive Diagnostic Modeling (CDM). This framework is based on three components, cognition, observation, and interpretation (National Research Council, 2001). Specifically, this dissertation…

Descriptors: STEM Education, Cognitive Processes, Observation, Psychometrics

Enhancing the Interpretability of the Overall Results of an International Test of English-Language Proficiency

Peer reviewed

Direct link

Papageorgiou, Spiros; Morgan, Rick; Becker, Valerie – International Journal of Testing, 2015

The purpose of this study was to enhance the meaning of the scores of an English-language test by developing performance levels and descriptors for reporting overall test performance. The levels and descriptors were intended to accompany the total scale scores of TOEFL Junior® Standard, an international test of English as a second/foreign…

Descriptors: Language Proficiency, Language Tests, English (Second Language), Second Language Learning

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

ACT Reporting Category Interpretation Guide: Version 1.0. ACT Working Paper 2016 (05)

Download full text

Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016

ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…

Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement

Does Test Item Performance Increase with Test-to-Standards Alignment?

Peer reviewed

Direct link

Traynor, Anne – Educational Assessment, 2017

Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…

Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum

Is What You See What You Really Get? Comparison of Scoring Techniques in the Assessment of Real-World Divergent Thinking

Peer reviewed

Direct link

Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014

In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…

Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 20

Educational and Psychological…	9
Test Service Bulletin	7
Journal of Educational…	5
Psychology in the Schools	5
Psychometrika	5
Journal of Clinical Psychology	4
Journal of Special Education	4
Applied Psychological…	3
Intelligence	3
Journal of School Psychology	3
Journal of Educational…	2
Online Submission	2
Psychological Assessment	2
Research Papers in Education	2
ACT, Inc.	1
American Journal of Physics	1
American Journal on Mental…	1
Applied Measurement in…	1
Audio-Visual Language Journal	1
Australian Journal of…	1
Canadian Modern Language…	1
College Entrance Examination…	1
College Student Journal	1
College and University	1
Council for Aid to Education	1
More ▼

Brennan, Robert L.	4
Reynolds, Cecil R.	4
Reilly, Richard R.	3
Shoemaker, David M.	3
Thompson, Bruce	3
Barker, Pierce	2
Besel, Ronald	2
Bock, R. Darrell	2
Boldt, Robert F.	2
Borich, Gary D.	2
Cliff, Norman	2
Epstein, Kenneth I.	2
Frary, Robert B.	2
Garvin, Alfred D.	2
Hambleton, Ronald K.	2
Lawson, Edwin D.	2
Lindsay, Carl A.	2
McKinley, Robert L.	2
Mislevy, Robert J.	2
Myers, Charles T.	2
Pelavin, Sol H.	2
Prichard, Mark A.	2
Reckase, Mark D.	2
Silver, Stephen J.	2
More ▼

Reports - Research	124
Journal Articles	74
Speeches/Meeting Papers	39
Reports - Evaluative	27
Reports - Descriptive	13
Guides - Non-Classroom	7
Opinion Papers	7
Reports - General	6
Guides - General	5
Information Analyses	5
Numerical/Quantitative Data	5
Tests/Questionnaires	5
Guides - Classroom - Learner	4
Books	3
ERIC Digests in Full Text	3
ERIC Publications	3
Collected Works - Proceedings	1
Collected Works - Serials	1
Computer Programs	1
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Legal/Legislative/Regulatory…	1
Non-Print Media	1
Reference Materials -…	1
More ▼

Wechsler Intelligence Scale…	7
Metropolitan Achievement Tests	6
SAT (College Admission Test)	5
Test of English as a Foreign…	5
ACT Assessment	3
California Achievement Tests	3
Comprehensive Tests of Basic…	3
Graduate Record Examinations	3
National Assessment of…	3
Armed Services Vocational…	2
Iowa Tests of Basic Skills	2
Minnesota Multiphasic…	2
Peabody Individual…	2
Stanford Achievement Tests	2
Stanford Binet Intelligence…	2
Strong Campbell Interest…	2
Strong Vocational Interest…	2
Adjective Check List	1
College Board Achievement…	1
Edwards Personal Preference…	1
Embedded Figures Test	1
Family Environment Scale	1
Gates MacGinitie Reading Tests	1
Gray Oral Reading Test	1
Group Embedded Figures Test	1
More ▼