ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	33

Descriptor

Educational Testing	94
Evaluation Methods	94
Student Evaluation	38
Educational Assessment	37
Elementary Secondary Education	31
Foreign Countries	19
Testing Problems	17
Program Evaluation	16
Achievement Tests	14
Program Effectiveness	14
Accountability	12
State Programs	12
Testing Programs	12
Test Validity	11
Academic Achievement	10
Disabilities	10
Evaluation Research	10
Models	10
Standardized Tests	10
Test Construction	10
Test Items	10
Educational Policy	8
Educational Research	8
Teacher Attitudes	8
Test Use	8
More ▼

Publication Type

Reports - Research	94
Journal Articles	45
Opinion Papers	9
Speeches/Meeting Papers	8
Tests/Questionnaires	4
Information Analyses	3
Numerical/Quantitative Data	3
Reports - Descriptive	3
Book/Product Reviews	1
Collected Works - General	1
Collected Works - Proceedings	1
Collected Works - Serial	1
Dissertations/Theses -…	1
Guides - Non-Classroom	1
More ▼

Education Level

Elementary Secondary Education	15
Elementary Education	5
Postsecondary Education	5
Secondary Education	5
Adult Education	3
Higher Education	3
High Schools	2
Early Childhood Education	1
Grade 6	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	3
Practitioners	2

Location

Canada	6
United Kingdom	6
Australia	2
Hong Kong	2
Kentucky	2
Maryland	2
Taiwan	2
California	1
Cambodia	1
China	1
Colombia	1
Delaware	1
Germany	1
India	1
Iran	1
Japan	1
Maine	1
Michigan	1
Minnesota	1
Nebraska	1
Netherlands	1
New York	1
Ohio	1
Pennsylvania	1
South Africa	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	5
Individuals with Disabilities…	2
No Child Left Behind Act 2001	1

Assessments and Surveys

ACT Assessment	1
Advanced Placement…	1
Pediatric Evaluation of…	1
Program for International…	1
Sequential Tests of…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 94 results Save | Export

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Assessment Practices of Educational Psychologists and Other Educational Professionals

Peer reviewed

Direct link

Atkinson, Cathy; Barrow, Joanna; Norris, Sarah – Educational Psychology in Practice, 2022

Assessment is one of the five functions of the educational psychologist's (EP's) role, yet there is a dearth of research exploring its distinctive contribution to school-based practice, and a lack of definition about what it is. In this study, the assessment practices of EPs were compared with those of other educational professionals who had…

Descriptors: Educational Psychology, School Psychologists, Evaluation Methods, Educational Testing

Optimizing Large-Scale Educational Assessment with a "Divide-and-Conquer" Strategy: Fast and Efficient Distributed Bayesian Inference in IRT Models

Peer reviewed

Direct link

Sainan Xu; Jing Lu; Jiwei Zhang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

With the growing attention on large-scale educational testing and assessment, the ability to process substantial volumes of response data becomes crucial. Current estimation methods within item response theory (IRT), despite their high precision, often pose considerable computational burdens with large-scale data, leading to reduced computational…

Descriptors: Educational Assessment, Bayesian Statistics, Statistical Inference, Item Response Theory

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

The Effect of Missing Data Treatment on Mantel-Haenszel DIF Detection

Peer reviewed

Direct link

Emenogu, Barnabas C.; Falenchuk, Olesya; Childs, Ruth A. – Alberta Journal of Educational Research, 2010

Most implementations of the Mantel-Haenszel differential item functioning procedure delete records with missing responses or replace missing responses with scores of 0. These treatments of missing data make strong assumptions about the causes of the missing data. Such assumptions may be particularly problematic when groups differ in their patterns…

Descriptors: Foreign Countries, Test Bias, Test Items, Educational Testing

The Evidence for a Subscore Structure in a Test of English Language Competency for English Language Learners

Peer reviewed

Direct link

Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015

How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…

Descriptors: English, Language Skills, English Language Learners, Scores

Student Achievement and Education System Performance in a Developing Country

Peer reviewed

Direct link

Marshall, Jeffery H.; Chinna, Ung; Hok, Ung Ngo; Tinon, Souer; Veasna, Meung; Nissay, Put – Educational Assessment, Evaluation and Accountability, 2012

The global spread of national assessment testing activities, and the growing pressure to move beyond basic measures of participation in educational monitoring, means that student achievement measures are likely to become increasingly relevant indicators of systemic progress in the developing world. Using data from the CESSP project in Cambodia,…

Descriptors: Foreign Countries, Academic Achievement, Developing Nations, Evaluation Methods

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Comparison of Examination Methods Based on Multiple-Choice Questions and Constructed-Response Questions Using Personal Computers

Peer reviewed

Direct link

Ventouras, Errikos; Triantis, Dimos; Tsiakas, Panagiotis; Stergiopoulos, Charalampos – Computers & Education, 2010

The aim of the present research was to compare the use of multiple-choice questions (MCQs) as an examination method, to the examination based on constructed-response questions (CRQs). Despite that MCQs have an advantage concerning objectivity in the grading process and speed in production of results, they also introduce an error in the final…

Descriptors: Computer Assisted Instruction, Scoring, Grading, Comparative Analysis

Model-Free CUSUM Methods for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009

This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…

Descriptors: Probability, Simulation, Models, Psychometrics

A Web-Based Model for Developing Assessment Literacy of Secondary In-Service Teachers

Peer reviewed

Direct link

Fan, Ya-Ching; Wang, Tzu-Hua; Wang, Kuo-Hua – Computers & Education, 2011

This research investigates the effect of a web-based model, named "Practicing, Reflecting, and Revising with Web-based Assessment and Test Analysis system (P2R-WATA) Assessment Literacy Development Model," on enhancing assessment knowledge and perspectives of secondary in-service teachers, and adopts a single group experimental research…

Descriptors: Research Design, Test Items, Summer Programs, Prior Learning

Peer Assessment in a Test-Dominated Setting: Empowering, Boring or Facilitating Examination Preparation?

Peer reviewed

Direct link

Bryant, Darren A.; Carless, David R. – Educational Research for Policy and Practice, 2010

The literature suggests that peer assessment contributes to the development of student learning and promotes ownership of assessment processes. These claims emerge from research conducted primarily in Western contexts. This exploratory paper reports on the perspectives that a class of Hong Kong primary school students and their teachers have on…

Descriptors: Feedback (Response), Peer Evaluation, Foreign Countries, Language Proficiency

ETS Research Spotlight: Issue 2

Download full text

Johnson, Jeff, Ed. – Educational Testing Service, 2009

In four articles adapted from the Educational Testing Service (ETS) Research Report Series, Issue 2 of ETS Research Spotlight provides a small taste of the range of assessment-related research capabilities of the ETS Research and Development Division. Those articles cover assessment-related research aimed at developing models of student learning,…

Descriptors: Basic Writing, Educational Testing, Research Reports, Measures (Individuals)

Automatic Item Generation of Probability Word Problems

Peer reviewed

Direct link

Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina – Studies in Educational Evaluation, 2009

Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…

Descriptors: Word Problems (Mathematics), Probability, Automation, College Students

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational Research	4
Studies in Educational…	4
Journal of Educational…	3
Applied Measurement in…	2
Computers & Education	2
Alberta Journal of…	1
Bilingual Research Journal	1
Canadian Journal of Education	1
Center for Education Policy,…	1
College Student Journal	1
College Teaching	1
Community College Journal of…	1
Diagnostique	1
ETS Research Report Series	1
Educational Assessment	1
Educational Assessment,…	1
Educational Psychology in…	1
Educational Psychology: An…	1
Educational Research for…	1
Educational Technology &…	1
Educational Testing Service	1
Educational and Psychological…	1
Evaluation Practice	1
Grantee Submission	1
International Journal of…	1
More ▼

Thurlow, Martha	8
Bielinski, John	2
El Sawaf, Hamdy	2
Erickson, Ron	2
Erickson, Ronald	2
Haigh, John	2
Leitzel, Thomas C.	2
Liu, Kristin	2
Minnema, Jane	2
Spicuzza, Richard	2
Trimble, Scott	2
Vogler, Daniel E.	2
Ysseldyke, Jim	2
Abelow, David	1
Armstrong, Ronald D.	1
Atkinson, Cathy	1
Bagnato, Stephen J.	1
Baldwin, Su G.	1
Ban, Jae-Chun	1
Bank, Adrianne	1
Barrow, Joanna	1
Bertling, Jonas P.	1
Bloom, Davida	1
Brictson, Paula T.	1
More ▼