ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	12
Since 2007 (last 20 years)	28

Descriptor

Evaluation Methods	49
Statistical Analysis	49
Test Items	49
Test Bias	15
Test Construction	15
Foreign Countries	13
Item Response Theory	13
Item Analysis	12
Student Evaluation	11
Simulation	8
Models	7
Scores	7
Test Validity	7
Difficulty Level	6
Evaluation Research	6
Computer Assisted Testing	5
Correlation	5
Error of Measurement	5
Grade 4	5
Achievement Tests	4
College Faculty	4
Computer Software	4
Factor Analysis	4
Latent Trait Theory	4
Mathematical Models	4
More ▼

Publication Type

Journal Articles	39
Reports - Research	32
Reports - Evaluative	9
Speeches/Meeting Papers	6
Reports - Descriptive	5
Dissertations/Theses -…	1
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1
Tests/Questionnaires	1
More ▼

Education Level

Postsecondary Education	8
Higher Education	7
Secondary Education	5
Elementary Education	4
Elementary Secondary Education	3
Grade 4	3
High Schools	2
Intermediate Grades	2
Middle Schools	2
Grade 6	1
Grade 8	1
Junior High Schools	1
Preschool Education	1
More ▼

Audience

Researchers

Location

Germany	3
Italy	2
Bangladesh	1
Florida	1
Nigeria	1
North Carolina (Charlotte)	1
Oman	1
Slovakia	1
Taiwan	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Florida Comprehensive…	1
Piers Harris Childrens Self…	1
Program for International…	1
Rokeach Value Survey	1
Strong Campbell Interest…	1
Tennessee Self Concept Scale	1
Trends in International…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 49 results Save | Export

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

Dimension-Corrected Somers' D for the Item Analysis Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

A new index of item discrimination power (IDP), dimension-corrected Somers' D (D2) is proposed. Somers' D is one of the superior alternatives for item-total- (Rit) and item-rest correlation (Rir) in reflecting the real IDP with items with scales 0/1 and 0/1/2, that is, up to three categories. D also reaches the extreme value +1 and -1 correctly…

Descriptors: Item Analysis, Correlation, Test Items, Simulation

Evaluating the Effects of Analytical Decisions in Large-Scale Assessments: Analyzing PISA Mathematics 2003-2012

Peer reviewed

Direct link

Heine, Jörg-Henrik; Robitzsch, Alexander – Large-scale Assessments in Education, 2022

Research Question: This paper examines the overarching question of to what extent different analytic choices may influence the inference about country-specific cross-sectional and trend estimates in international large-scale assessments. We take data from the assessment of PISA mathematics proficiency from the four rounds from 2003 to 2012 as a…

Descriptors: Foreign Countries, International Assessment, Achievement Tests, Secondary School Students

Optimal Weighting for Exam Composition

Peer reviewed
PDF on ERIC

Download full text

Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018

A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…

Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests

Variability in Percentage above Cut Scores Due to Discreteness in Score Scale. Research Report. ETS RR-17-32

Peer reviewed
PDF on ERIC

Download full text

Lu, Ying – ETS Research Report Series, 2017

For standard- or criterion-based assessments, the use of cut scores to indicate mastery, nonmastery, or different levels of skill mastery is very common. As part of performance summary, it is of interest to examine the percentage of examinees at or above the cut scores (PAC) and how PAC evolves across administrations. This paper shows that…

Descriptors: Cutting Scores, Evaluation Methods, Mastery Learning, Performance Based Assessment

Psychometric Consequences of Subpopulation Item Parameter Drift

Peer reviewed

Direct link

Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing

Probing the Question Order Effect While Developing a Chemistry Concept Inventory

Peer reviewed

Direct link

Undersander, Molly A.; Lund, Travis J.; Langdon, Laurie S.; Stains, Marilyne – Chemistry Education Research and Practice, 2017

The design of assessment tools is critical to accurately evaluate students' understanding of chemistry. Although extensive research has been conducted on various aspects of assessment tool design, few studies in chemistry have focused on the impact of the order in which questions are presented to students on the measurement of students'…

Descriptors: Test Construction, Scientific Concepts, Concept Formation, Science Education

A Facet-Factorial Approach towards the Development and Validation of a Jazz Rhythm Section Performance Rating Scale

Peer reviewed

Direct link

Wesolowski, Brian C. – International Journal of Music Education, 2017

The purpose of this study was to develop a valid and reliable rating scale to assess jazz rhythm sections in the context of jazz big band performance. The research questions that guided this study included: (a) what central factors contribute to the assessment of a jazz rhythm section? (b) what items should be used to describe and assess a jazz…

Descriptors: Test Construction, Rating Scales, Music, Evaluation Methods

Focusing on Interactions between Content and Cognition: A New Perspective on Gender Differences in Mathematical Sub-Competencies

Peer reviewed

Direct link

George, Ann Cathrice; Robitzsch, Alexander – Applied Measurement in Education, 2018

This article presents a new perspective on measuring gender differences in the large-scale assessment study Trends in International Science Study (TIMSS). The suggested empirical model is directly based on the theoretical competence model of the domain mathematics and thus includes the interaction between content and cognitive sub-competencies.…

Descriptors: Achievement Tests, Elementary Secondary Education, Mathematics Achievement, Mathematics Tests

Development and Validation of the Learning Progression-Based Assessment of Modern Genetics in a High School Context

Peer reviewed

Direct link

Todd, Amber; Romine, William L.; Cook Whitt, Katahdin – Science Education, 2017

We describe the development, validation, and use of the "Learning Progression-Based Assessment of Modern Genetics" (LPA-MG) in a high school biology context. Items were constructed based on a current learning progression framework for genetics (Shea & Duncan, 2013; Todd & Kenyon, 2015). The 34-item instrument, which was tied to…

Descriptors: Genetics, Science Instruction, High School Students, Evaluation Methods

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

Examining the Validity of GOLD® with 4-Year-Old Dual Language Learners

Peer reviewed

Direct link

Kim, Do-Hong; Lambert, Richard G.; Durham, Sean; Burts, Diane C. – Early Education and Development, 2018

Research Findings: This study builds on prior work related to the assessment of young dual language learners (DLLs). The purposes of the study were to (a) determine whether latent subgroups of preschool DLLs would replicate those found previously and (b) examine the validity of GOLD® by Teaching Strategies with empirically derived subgroups.…

Descriptors: Preschool Education, Teaching Methods, Bilingualism, Bilingual Education

Innovative Assessments That Support Students' STEM Learning

Direct link

Thummaphan, Phonraphee – ProQuest LLC, 2017

The present study aimed to represent the innovative assessments that support students' learning in STEM education through using the integrative framework for Cognitive Diagnostic Modeling (CDM). This framework is based on three components, cognition, observation, and interpretation (National Research Council, 2001). Specifically, this dissertation…

Descriptors: STEM Education, Cognitive Processes, Observation, Psychometrics

Construction of Expert Knowledge Monitoring and Assessment System Based on Integral Method of Knowledge Evaluation

Peer reviewed
PDF on ERIC

Download full text

Golovachyova, Viktoriya N.; Menlibekova, Gulbakhyt Zh.; Abayeva, Nella F.; Ten, Tatyana L.; Kogaya, Galina D. – International Journal of Environmental and Science Education, 2016

Using computer-based monitoring systems that rely on tests could be the most effective way of knowledge evaluation. The problem of objective knowledge assessment by means of testing takes on a new dimension in the context of new paradigms in education. The analysis of the existing test methods enabled us to conclude that tests with selected…

Descriptors: Expertise, Computer Assisted Testing, Student Evaluation, Knowledge Level

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	8
ETS Research Report Series	2
Journal of Educational…	2
Online Submission	2
Psychometrika	2
African Higher Education…	1
Applied Measurement in…	1
Applied Psychological…	1
Chemistry Education Research…	1
Early Education and…	1
Education Sciences	1
Educational Measurement:…	1
English Language Teaching	1
Interactive Technology and…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Educational and…	1
Journal of Pedagogy	1
Large-scale Assessments in…	1
National Center on…	1
More ▼

DeMars, Christine E.	2
Hambleton, Ronald K.	2
Merz, William R.	2
Robitzsch, Alexander	2
Rogers, H. Jane	2
Sinharay, Sandip	2
Socha, Alan	2
Abayeva, Nella F.	1
Ajuonuma, Juliet O.	1
Al Ajmi, Ahmed Ali Saleh	1
Ali, Holi Ibrahim Holi	1
Ali, Md Ramjan	1
Ankenmann, Robert D.	1
Arendasy, Martin	1
Bartolucci, F.	1
Blais, Jean-Guy	1
Bolt, Sara	1
Borowski, Andreas	1
Brandriet, Alexandra R.	1
Bretz, Stacey Lowery	1
Bridges, Claude F.	1
Burts, Diane C.	1
Buscema, Massimo	1
Cook Whitt, Katahdin	1
More ▼