ERIC - Search Results

Publication Date

In 2025	9
Since 2024	31
Since 2021 (last 5 years)	88
Since 2016 (last 10 years)	197
Since 2006 (last 20 years)	401

Descriptor

Test Interpretation	3974
Test Validity	958
Test Construction	688
Elementary Secondary Education	677
Scores	650
Test Results	623
Test Reliability	622
Testing	549
Achievement Tests	510
Standardized Tests	490
Testing Problems	488
Test Use	380
Academic Achievement	370
Higher Education	366
Evaluation Methods	362
Intelligence Tests	359
Scoring	351
Student Evaluation	351
Educational Assessment	297
Statistical Analysis	295
Educational Testing	291
Testing Programs	281
Elementary Education	263
Comparative Analysis	260
Criterion Referenced Tests	255
More ▼

Education Level

Elementary Secondary Education	63
Higher Education	63
Postsecondary Education	49
Elementary Education	45
Secondary Education	43
Middle Schools	17
High Schools	13
Junior High Schools	13
Early Childhood Education	12
Grade 4	12
Grade 8	10
Intermediate Grades	8
Primary Education	7
Grade 7	6
Kindergarten	6
Adult Education	5
Grade 1	4
Grade 3	4
Grade 5	4
Grade 6	4
Preschool Education	3
Grade 12	2
Grade 2	2
Grade 9	2
Adult Basic Education	1
More ▼

Audience

Practitioners	274
Researchers	122
Teachers	102
Administrators	63
Counselors	28
Parents	21
Policymakers	21
Students	15
Community	8

Location

Canada	44
California	33
Australia	32
United Kingdom	23
United States	19
Pennsylvania	18
United Kingdom (England)	16
New York	15
Michigan	14
Japan	13
New Jersey	12
Massachusetts	10
United Kingdom (Great Britain)	10
Illinois	9
Texas	9
United Kingdom (Wales)	9
Alaska	8
Florida	8
Indiana	8
Israel	8
Netherlands	8
West Germany	8
Delaware	7
Germany	7
Kentucky	7
More ▼

What Works Clearinghouse Rating

Test Interpretation X

Showing 166 to 180 of 3,974 results Save | Export

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

Evaluation of the Psychometric Quality and Validity of a Student Survey of Instruction in Bangkok University, Thailand

Direct link

Chamoy, Waritsa – ProQuest LLC, 2018

The main purpose of this study was to conduct a validation analysis of student surveys of teaching effectiveness implemented at Bangkok University, Thailand. This study included three phases; survey development, a pilot study, and a full implementation study. Four sources of validity evidence were collected to support intended interpretations and…

Descriptors: Foreign Countries, Psychometrics, Student Surveys, College Students

Polytomous Rasch Models in Counseling Assessment

Peer reviewed

Direct link

Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.

Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation

Exploring the Factor Structure of a K-12 English Language Proficiency Assessment

Peer reviewed

Direct link

Faulkner-Bond, Molly; Wolf, Mikyung Kim; Wells, Craig S.; Sireci, Stephen G. – Language Assessment Quarterly, 2018

In this study we investigated the internal factor structure of a large-scale K--12 assessment of English language proficiency (ELP) using samples of fourth- and eighth-grade English learners (ELs) in one state. While U.S. schools are mandated to measure students' ELP in four language domains (listening, reading, speaking, and writing), some ELP…

Descriptors: Factor Structure, Language Tests, Language Proficiency, Grade 4

PIRLS for Teachers: Making PIRLS Results More Useful for Practitioners. Policy Brief No. 17

Download full text

Hopfenbeck, Therese N.; Lenkeit, Jenny – International Association for the Evaluation of Educational Achievement, 2018

International large-scale assessments (ILSAs) have had an increasing influence on the discourse surrounding education systems around the world. However, the results of these studies tend to have less impact on pedagogy in the classroom than would be expected. For example, a recent review of 114 published peer-reviewed articles on the IEA's…

Descriptors: Foreign Countries, Achievement Tests, Grade 4, Reading Achievement

Statistical Classification for Cognitive Diagnostic Assessment: An Artificial Neural Network Approach

Peer reviewed

Direct link

Cui, Ying; Gierl, Mark; Guo, Qi – Educational Psychology, 2016

The purpose of the current investigation was to describe how the artificial neural networks (ANNs) can be used to interpret student performance on cognitive diagnostic assessments (CDAs) and evaluate the performances of ANNs using simulation results. CDAs are designed to measure student performance on problem-solving tasks and provide useful…

Descriptors: Cognitive Tests, Diagnostic Tests, Classification, Artificial Intelligence

How Does Polytomous Item Bias Affect Total-Group Survey Score Comparisons?

Peer reviewed

Direct link

Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017

The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…

Descriptors: Test Items, Test Bias, Item Response Theory, Surveys

A Multilevel Factor Analysis of Third-Party Evaluations of Noncognitive Constructs Used in Admissions Decision Making

Peer reviewed

Direct link

Oliveri, Maria; McCaffrey, Daniel; Ezzo, Chelsea; Holtzman, Steven – Applied Measurement in Education, 2017

The assessment of noncognitive traits is challenging due to possible response biases, "subjectivity" and "faking." Standardized third-party evaluations where an external evaluator rates an applicant on their strengths and weaknesses on various noncognitive traits are a promising alternative. However, accurate score-based…

Descriptors: Factor Analysis, Decision Making, College Admission, Likert Scales

Keeping Your Audience in Mind: Applying Audience Analysis to the Design of Interactive Score Reports

Peer reviewed

Direct link

Zapata-Rivera, Juan Diego; Katz, Irvin R. – Assessment in Education: Principles, Policy & Practice, 2014

Score reports have one or more intended audiences: the people who use the reports to make decisions about test takers, including teachers, administrators, parents and test takers. Attention to audience when designing a score report supports assessment validity by increasing the likelihood that score users will interpret and use assessment results…

Descriptors: Audience Analysis, Scores, Reports, Test Interpretation

Inter-Subject Comparability of Examination Standards in GCSE and GCE in England

Peer reviewed

Direct link

He, Qingping; Stockford, Ian; Meadows, Michelle – Oxford Review of Education, 2018

Results from Rasch analysis of GCSE and GCE A level data over a period of four years suggest that the standards of examinations in different subjects are not consistent in terms of the levels of the latent trait specified in the Rasch model required to achieve the same grades. Variability in statistical standards between subjects exists at both…

Descriptors: Foreign Countries, Exit Examinations, Intellectual Disciplines, Item Response Theory

Interpreting Reading Comprehension Test Results: Quantile Regression Shows That Explanatory Factors Can Vary with Performance Level

Peer reviewed

Direct link

Hua, Anh N.; Keenan, Janice M. – Scientific Studies of Reading, 2017

One of the most important findings to emerge from recent reading comprehension research is that there are large differences between tests in what they assess--specifically, the extent to which performance depends on word recognition versus listening comprehension skills. Because this research used ordinary least squares regression, it is not clear…

Descriptors: Reading Comprehension, Reading Tests, Test Interpretation, Regression (Statistics)

Two Kinds of Argument?

Peer reviewed

Direct link

Newton, Paul E. – Journal of Educational Measurement, 2013

Kane distinguishes between two kinds of argument: the interpretation/use argument and the validity argument. This commentary considers whether there really are two kinds of argument, two arguments, or just one. It concludes that there is just one argument: the validity argument. (Contains 2 figures and 5 notes.)

Descriptors: Validity, Test Interpretation, Test Use

Business Administration Scale for Family Child Care (BAS). Second Edition

Direct link

Talan, Teri N.; Bloom, Paula Jorde – Teachers College Press, 2018

The "Business Administration Scale for Family Child Care" (BAS) is the first valid and reliable tool for measuring and improving the overall quality of business and professional practices in family child care settings. It is applicable for multiple uses, including program self-improvement, technical assistance and monitoring, training,…

Descriptors: Business Administration, Child Care, Rating Scales, Qualifications

Criterion-Referenced Measurement: Half a Century Wasted?

Peer reviewed

Direct link

Popham, W. James – Educational Leadership, 2014

Fifty years ago, Robert Glaser introduced the concept of criterion-referenced measurement in an article in American Psychologist. Its early proponents predicted that this measurement strategy would revolutionize education. But has it lived up to its promise? W. James Popham explores this question by looking at the history of criterion-referenced…

Descriptors: Criterion Referenced Tests, Program Effectiveness, Misconceptions, Test Interpretation

Evaluating Structural Equation Models for Categorical Outcomes: A New Test Statistic and a Practical Challenge of Interpretation

Peer reviewed
PDF on ERIC

Download full text

Monroe, Scott; Cai, Li – Grantee Submission, 2015

This research is concerned with two topics in assessing model fit for categorical data analysis. The first topic involves the application of a limited-information overall test, introduced in the item response theory literature, to Structural Equation Modeling (SEM) of categorical outcome variables. Most popular SEM test statistics assess how well…

Descriptors: Structural Equation Models, Test Interpretation, Goodness of Fit, Item Response Theory

« Previous Page | Next Page »

Pages: 1 | ... | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | ... | 265

Educational and Psychological…	94
Journal of Educational…	79
Educational Measurement:…	75
Psychology in the Schools	68
Journal of Consulting and…	51
Journal of Clinical Psychology	44
Diagnostique	40
Perceptual and Motor Skills	40
Measurement and Evaluation in…	32
Journal of Learning…	30
Journal of School Psychology	30
Psychological Reports	30
Psychometrika	28
Journal of Psychoeducational…	26
Applied Measurement in…	25
Journal of Counseling…	24
School Psychology Review	24
Applied Psychological…	19
Journal of Personality…	19
Journal of Special Education	19
Psychol Rep	18
American Psychologist	16
Assessment	16
Journal of Reading	16
ProQuest LLC	16
More ▼

Linn, Robert L.	18
Hambleton, Ronald K.	16
Plake, Barbara S.	13
Prediger, Dale J.	13
Reynolds, Cecil R.	13
Thompson, Bruce	13
Messick, Samuel	12
Green, Donald Ross	11
Brennan, Robert L.	10
Ebel, Robert L.	10
Kaufman, Alan S.	10
Naglieri, Jack A.	10
Hills, John R.	9
Mehrens, William A.	9
Echternacht, Gary	8
Frary, Robert B.	8
Kane, Michael T.	8
Canivez, Gary L.	7
Farr, Roger	7
Livingston, Samuel A.	7
Mislevy, Robert J.	7
Popham, W. James	7
Tatsuoka, Kikumi K.	7
More ▼

Journal Articles	1391
Reports - Research	1252
Speeches/Meeting Papers	437
Reports - Evaluative	411
Reports - Descriptive	323
Guides - Non-Classroom	308
Opinion Papers	283
Information Analyses	165
Tests/Questionnaires	147
Guides - General	73
Numerical/Quantitative Data	65
Books	56
Reports - General	53
ERIC Publications	40
Guides - Classroom - Teacher	34
ERIC Digests in Full Text	33
Collected Works - Serials	21
Collected Works - Proceedings	18
Dissertations/Theses -…	17
Reference Materials -…	14
Legal/Legislative/Regulatory…	13
Book/Product Reviews	11
Guides - Classroom - Learner	11
Collected Works - General	7
Dissertations/Theses	7
More ▼

Elementary and Secondary…	31
No Child Left Behind Act 2001	14
Individuals with Disabilities…	6
Education for All Handicapped…	4
Elementary and Secondary…	4
Americans with Disabilities…	3
Education Consolidation…	2
Elementary and Secondary…	2
National Defense Education Act	2
Race to the Top	2
Bakke v Regents of University…	1
Bilingual Education Act 1968	1
Education Amendments 1972	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Emergency School Aid Act 1972	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Larry P v Riles	1
Proposition 227 (California…	1
Title IX Education Amendments…	1
More ▼

Wechsler Intelligence Scale…	144
National Assessment of…	72
SAT (College Admission Test)	51
Minnesota Multiphasic…	49
Stanford Binet Intelligence…	41
Wechsler Adult Intelligence…	40
Iowa Tests of Basic Skills	36
Stanford Achievement Tests	30
Comprehensive Tests of Basic…	24
Metropolitan Achievement Tests	22
Peabody Picture Vocabulary…	22
Test of English as a Foreign…	21
Kaufman Assessment Battery…	19
California Achievement Tests	18
Strong Campbell Interest…	18
ACT Assessment	16
Graduate Record Examinations	16
Sequential Tests of…	16
General Aptitude Test Battery	15
Illinois Test of…	13
Program for International…	13
Armed Services Vocational…	12
College Board Achievement…	11
McCarthy Scales of Childrens…	11
Strong Vocational Interest…	11
More ▼