ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	25
Since 2006 (last 20 years)	44

Descriptor

Scaling	129
Scoring	129
Test Construction	42
Item Response Theory	34
Test Items	31
Test Reliability	29
Educational Assessment	28
Test Validity	27
Academic Achievement	26
Scores	26
Testing Programs	24
Achievement Tests	23
Elementary Secondary Education	23
Equated Scores	22
Student Evaluation	19
Item Analysis	18
Testing	17
Grade 4	16
Grade 6	15
Test Results	15
Data Collection	14
Foreign Countries	14
Grade 8	14
Higher Education	14
Psychometrics	14
More ▼

Education Level

Elementary Education	17
Secondary Education	16
Grade 4	13
Grade 6	13
Intermediate Grades	13
Elementary Secondary Education	12
Middle Schools	12
Junior High Schools	11
Early Childhood Education	10
Grade 3	10
Grade 5	10
Primary Education	10
Grade 7	9
Grade 8	9
High Schools	9
Grade 9	4
Grade 10	3
Adult Education	2
Grade 11	2
Higher Education	2
Postsecondary Education	2
Grade 12	1
High School Equivalency…	1
More ▼

Audience

Researchers	13
Teachers	2
Policymakers	1
Practitioners	1
Students	1

Location

Australia	9
New York	5
United States	3
Austria	1
Belgium	1
California	1
Canada	1
Chile	1
China	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
Florida	1
France	1
Germany	1
India	1
Ireland	1
Italy	1
Japan	1
Kentucky	1
Netherlands	1
North Carolina	1
Norway	1
Panama	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
Education Consolidation…	1
Every Student Succeeds Act…	1
Kentucky Education Reform Act…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 129 results Save | Export

On the Merits of Longitudinal Multiple Group Modelling: An Alternative to Multilevel Modelling for Intervention Evaluations

Peer reviewed

Direct link

Little, Todd D.; Bontempo, Daniel; Rioux, Charlie; Tracy, Allison – International Journal of Research & Method in Education, 2022

Multilevel modelling (MLM) is the most frequently used approach for evaluating interventions with clustered data. MLM, however, has some limitations that are associated with numerous obstacles to model estimation and valid inferences. Longitudinal multiple-group (LMG) modelling is a longstanding approach for testing intervention effects using…

Descriptors: Longitudinal Studies, Hierarchical Linear Modeling, Alternative Assessment, Intervention

Impact of Categorization and Scaling on Classification Agreement and Prediction Accuracy Statistics. Research Report. ETS RR-21-26

Peer reviewed
PDF on ERIC

Download full text

Wang, Wei; Dorans, Neil J. – ETS Research Report Series, 2021

Agreement statistics and measures of prediction accuracy are often used to assess the quality of two measures of a construct. Agreement statistics are appropriate for measures that are supposed to be interchangeable, whereas prediction accuracy statistics are appropriate for situations where one variable is the target and the other variables are…

Descriptors: Classification, Scaling, Prediction, Accuracy

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Basic Concepts of Item Response Theory: A Nonmathematical Introduction. Research Memorandum. ETS RM-20-06

Download full text

Livingston, Samuel A. – Educational Testing Service, 2020

This booklet is a conceptual introduction to item response theory (IRT), which many large-scale testing programs use for constructing and scoring their tests. Although IRT is essentially mathematical, the approach here is nonmathematical, in order to serve as an introduction on the topic for people who want to understand why IRT is used and what…

Descriptors: Item Response Theory, Scoring, Test Items, Scaling

A Mokken Scale Analysis of the Last Series of the Standard Progressive Matrices (SPM-LS)

Peer reviewed
PDF on ERIC

Download full text

Myszkowski, Nils – Journal of Intelligence, 2020

Raven's Standard Progressive Matrices (Raven 1941) is a widely used 60-item long measure of general mental ability. It was recently suggested that, for situations where taking this test is too time consuming, a shorter version, comprised of only the last series of the Standard Progressive Matrices (Myszkowski and Storme 2018) could be used, while…

Descriptors: Intelligence Tests, Psychometrics, Nonparametric Statistics, Item Response Theory

Peer Assessment in MOOCs: Systematic Literature Review

Peer reviewed

Direct link

Gamage, Dilrukshi; Staubitz, Thomas; Whiting, Mark – Distance Education, 2021

We report on a systematic review of the landscape of peer assessment in massive open online courses (MOOCs) with papers from 2014 to 2020 in 20 leading education technology publication venues across four databases containing education technology-related papers, addressing three research issues: the evolution of peer assessment in MOOCs during the…

Descriptors: Peer Evaluation, Evaluation Methods, Online Courses, Large Group Instruction

A Comparison of Methodologies for Scaling Longitudinal Social-Emotional Survey Responses

Peer reviewed

Direct link

Soland, James; Kuhfeld, Megan; Register, Brennan – Educational Assessment, 2023

Much of what we know about how children develop is based on survey data. In order to estimate growth across time and, thereby, better understand that development, short survey scales are typically administered at repeated timepoints. Before estimating growth, those repeated measures must be put onto the same scale. Yet, little research examines…

Descriptors: Comparative Analysis, Social Emotional Learning, Scaling, Effect Size

Unidimensional Vertical Scaling in Multidimensional Space. Research Report. ETS RR-17-29

Peer reviewed
PDF on ERIC

Download full text

Carlson, James E. – ETS Research Report Series, 2017

In this paper, I consider a set of test items that are located in a multidimensional space, S[subscript M], but are located along a curved line in S[subscript M] and can be scaled unidimensionally. Furthermore, I am demonstrating a case in which the test items are administered across 6 levels, such as occurs in K-12 assessment across 6 grade…

Descriptors: Test Items, Item Response Theory, Difficulty Level, Scoring

Modeling of Item Response Functions under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2020

This study presents new models for item response functions (IRFs) in the framework of the D-scoring method (DSM) that is gaining attention in the field of educational and psychological measurement and largescale assessments. In a previous work on DSM, the IRFs of binary items were estimated using a logistic regression model (LRM). However, the LRM…

Descriptors: Item Response Theory, Scoring, True Scores, Scaling

U.S. Technical Report and User Guide for the 2019 Trends in International Mathematics and Science Study (TIMSS). Part 1. NCES 2022-049

Peer reviewed
PDF on ERIC

Download full text

Egan, Laura; Tang, Judy H.; Ferraro, David; Erberber, Ebru; Tsokodayi, Yemurai; Stearns, Pat – National Center for Education Statistics, 2022

Trends in International Mathematics and Science Study (TIMSS) is an international comparative study designed to measure trends in mathematics and science achievement at grades 4 and 8, as well as to collect information about educational contexts (such as students' schools, teachers, and homes) that may be related to student achievement. TIMSS has…

Descriptors: Achievement Tests, Mathematics Achievement, International Assessment, Foreign Countries

U.S. PIRLS and ePIRLS 2016 Technical Report and User's Guide. NCES 2019-113

Peer reviewed
PDF on ERIC

Download full text

Herget, Debbie; Dalton, Ben; Kinney, Saki; Smith, W. Zachary; Wilson, David; Rogers, Jim – National Center for Education Statistics, 2019

The Progress in International Reading Literacy Study (PIRLS) is an international comparative study of student performance in reading literacy at the fourth grade. PIRLS 2016 marks the fourth iteration of the study, which has been conducted every 5 years since 2001. New to the PIRLS assessment in 2016, ePIRLS provides a computer-based extension to…

Descriptors: Achievement Tests, Grade 4, Reading Achievement, Foreign Countries

Toward Education Quality Improvement in China: A Brief Overview of the National Assessment of Education Quality

Peer reviewed

Direct link

Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019

This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…

Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests

Social Communication Questionnaire Scoring Procedures for Autism Spectrum Disorder and the Prevalence of Potential Social Communication Disorder in ASD

Peer reviewed

Direct link

Barnard-Brak, Lucy; Richman, David M.; Chesnut, Steven Randall; Little, Todd D. – School Psychology Quarterly, 2016

In analyzing data from the National Database for Autism Research, we utilized Mokken scaling techniques as a means of creating a more effective and efficient screening procedure for autism spectrum disorder (ASD) via the Social Communication Questionnaire (SCQ). With a sample of 1,040, approximately 80% (n = 827) of the sample were males while…

Descriptors: Autism, Pervasive Developmental Disorders, Communication Problems, Screening Tests

Diagnosing Conceptions about the Epistemology of Science: Contributions of a Quantitative Assessment Methodology

Peer reviewed

Direct link

Vázquez-Alonso, Ángel; Manassero-Mas, María-Antonia; García-Carmona, Antonio; Montesano de Talavera, Marisa – Asia-Pacific Forum on Science Learning and Teaching, 2016

This study applies a new quantitative methodological approach to diagnose epistemology conceptions in a large sample. The analyses use seven multiple-rating items on the epistemology of science drawn from the item pool Views on Science-Technology-Society (VOSTS). The bases of the new methodological diagnostic approach are the empirical…

Descriptors: Epistemology, Statistical Analysis, Science and Society, Scientific Principles

Elementary Mathematics Student Assessment: Measuring the Performance of Grade 3, 4, and 5 Students in Number (Whole Numbers and Fractions), Operations, and Algebraic Thinking in Fall 2015. Research Report No. 2018-24

Download full text

Schoen, Robert C.; Anderson, Daniel; Riddell, Claire M.; Bauduin, Charity – Online Submission, 2018

This report provides a description of the development process, field testing, and psychometric properties of the fall 2015 grades 3-5 Elementary Mathematics Student Assessment (EMSA), a student mathematics test designed to be administered in a whole-group setting to students in grades 3, 4, and 5. The test was administered to 2,614 participating…

Descriptors: Elementary School Students, Elementary School Mathematics, Grade 3, Grade 4

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Applied Psychological…	5
Educational Measurement:…	5
Educational and Psychological…	5
Ministerial Council on…	5
Journal of Educational…	4
New York State Education…	4
Partnership for Assessment of…	4
ETS Research Report Series	3
National Center for Education…	3
Studies in Educational…	3
Applied Measurement in…	2
Educational Assessment	2
Educational Testing Service	2
Evaluation and the Health…	2
Multivariate Behavioral…	2
New Meridian Corporation	2
ACT, Inc.	1
American Journal on Mental…	1
Asia-Pacific Forum on Science…	1
College Entrance Examination…	1
Distance Education	1
Evaluation Review	1
Evaluation and Program…	1
Exceptional Children	1
Foreign Language Annals	1
More ▼

Donovan, Jenny	3
Kolen, Michael J.	3
Lennon, Melissa	3
Yen, Wendy M.	3
Beaton, Albert E.	2
Bock, R. Darrell	2
Braungart-Bloom, Diane S.	2
Cook, Linda L.	2
Dorans, Neil J.	2
Hutton, Penny	2
Johnson, Eugene G.	2
Lee, Won-Chan	2
Little, Todd D.	2
Livingston, Samuel A.	2
Luecht, Richard M.	2
Morrissey, Noni	2
O'Connor, Gayl	2
Sireci, Stephen G.	2
Zwick, Rebecca	2
Aiken, Leona S.	1
Allen, Nancy L.	1
Amsbary, Michelle	1
Anderson, Daniel	1
Anderson, David O.	1
More ▼

Journal Articles	50
Reports - Evaluative	46
Reports - Research	39
Speeches/Meeting Papers	22
Reports - Descriptive	21
Numerical/Quantitative Data	19
Tests/Questionnaires	17
Guides - Non-Classroom	6
Opinion Papers	5
Collected Works - General	4
Information Analyses	4
Books	3
Reports - General	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Learner	1
Historical Materials	1
More ▼

National Assessment of…	9
ACT Assessment	3
Program for International…	3
ACT Interest Inventory	2
General Educational…	2
Iowa Tests of Basic Skills	2
SAT (College Admission Test)	2
Bayley Scales of Infant…	1
Beck Depression Inventory	1
California Achievement Tests	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Kaufman Test of Educational…	1
Metropolitan Achievement Tests	1
National Assessment of Adult…	1
National Teacher Examinations	1
Progress in International…	1
Raven Progressive Matrices	1
Test of English as a Foreign…	1
Test of Written English	1
Trends in International…	1
More ▼