ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	29

Descriptor

Psychometrics	91
Testing Programs	91
Test Construction	33
Elementary Secondary Education	28
State Programs	26
Educational Assessment	25
Test Validity	21
Test Use	19
Scoring	17
Student Evaluation	17
Test Items	17
Achievement Tests	16
Standardized Tests	16
Educational Testing	15
Item Response Theory	15
Evaluation Methods	14
Test Reliability	14
Academic Achievement	13
Mathematics Tests	12
Validity	12
Computer Assisted Testing	11
Statistical Analysis	11
Testing Problems	11
Evaluation Research	9
Measurement Techniques	9
More ▼

Education Level

Elementary Secondary Education	14
Higher Education	4
Elementary Education	3
Grade 6	3
Grade 8	3
Grade 3	2
Grade 4	2
Grade 5	2
Grade 7	2
Middle Schools	2
Adult Education	1
Early Childhood Education	1
Grade 10	1
Grade 2	1
Grade 9	1
High Schools	1
Postsecondary Education	1
More ▼

Audience

Researchers	6
Practitioners	3
Students	2
Teachers	2
Parents	1

Location

Kentucky	2
Massachusetts	2
United States	2
Connecticut	1
Dominica	1
Georgia	1
Grenada	1
Hawaii	1
Indiana	1
Malawi	1
Michigan	1
New York	1
Oregon	1
Pennsylvania	1
Saint Lucia	1
Saint Vincent and the…	1
Singapore	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Debra P v Turlington	1
Individuals with Disabilities…	1

Assessments and Surveys

SAT (College Admission Test)	3
Massachusetts Comprehensive…	2
National Assessment of…	2
Program for International…	2
California Achievement Tests	1
Delaware Student Testing…	1
Early Childhood Longitudinal…	1
General Aptitude Test Battery	1
Graduate Record Examinations	1
Metropolitan Achievement Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Progress in International…	1
Stanford Achievement Tests	1
System of Multicultural…	1
Texas Assessment of Academic…	1
Trends in International…	1
Watson Glaser Critical…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 91 results Save | Export

Using Rasch Measurement to Score, Evaluate, and Improve Examinations in an Anatomy Course

Peer reviewed

Direct link

Royal, Kenneth D.; Gilliland, Kurt O.; Kernick, Edward T. – Anatomical Sciences Education, 2014

Any examination that involves moderate to high stakes implications for examinees should be psychometrically sound and legally defensible. Currently, there are two broad and competing families of test theories that are used to score examination data. The majority of instructors outside the high-stakes testing arena rely on classical test theory…

Descriptors: Item Response Theory, Scoring, Evaluation Methods, Anatomy

A Review of International Large-Scale Assessments in Education: Assessing Component Skills and Collecting Contextual Data. PISA for Development

Direct link

Cresswell, John; Schwantner, Ursula; Waters, Charlotte – OECD Publishing, 2015

This report reviews the major international and regional large-scale educational assessments, including international surveys, school-based surveys and household-based surveys. The report compares and contrasts the cognitive and contextual data collection instruments and implementation methods used by the different assessments in order to identify…

Descriptors: International Assessment, Educational Assessment, Data Collection, Comparative Analysis

Accumulative Equating Error after a Chain of Linear Equatings

Peer reviewed

Direct link

Guo, Hongwen – Psychometrika, 2010

After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…

Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores

The Potential Impact of Not Being Able to Create Parallel Tests on Expected Classification Accuracy

Peer reviewed

Direct link

Wyse, Adam E. – Applied Psychological Measurement, 2011

In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…

Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

Science Assessments for All: Integrating Science Simulations into Balanced State Science Assessment Systems

Peer reviewed

Direct link

Quellmalz, Edys S.; Timms, Michael J.; Silberglitt, Matt D.; Buckley, Barbara C. – Journal of Research in Science Teaching, 2012

This article reports on the collaboration of six states to study how simulation-based science assessments can become transformative components of multi-level, balanced state science assessment systems. The project studied the psychometric quality, feasibility, and utility of simulation-based science assessments designed to serve formative purposes…

Descriptors: State Programs, Educational Assessment, Simulated Environment, Grade 6

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

Design of a Computer-Adaptive Test to Measure English Literacy and Numeracy in the Singapore Workforce: Considerations, Benefits, and Implications

Peer reviewed

Direct link

Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011

A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…

Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries

Mixture Item Response Theory-MIMIC Model: Simultaneous Estimation of Differential Item Functioning for Manifest Groups and Latent Classes

Direct link

Bilir, Mustafa Kuzey – ProQuest LLC, 2009

This study uses a new psychometric model (mixture item response theory-MIMIC model) that simultaneously estimates differential item functioning (DIF) across manifest groups and latent classes. Current DIF detection methods investigate DIF from only one side, either across manifest groups (e.g., gender, ethnicity, etc.), or across latent classes…

Descriptors: Test Items, Testing Programs, Markov Processes, Psychometrics

The Determinants of the Post-Adoption Satisfaction of Educators with an E-Learning System

Peer reviewed

Direct link

Islam, A. K. M. Najmul – Journal of Information Systems Education, 2011

This paper examines factors that influence the post-adoption satisfaction of educators with e-learning systems. Based on the expectation-confirmation framework, we propose a research model that demonstrates how post-adoption beliefs affect post-adoption satisfaction. The model was tested at a university by educators (n = 175) who use an e-learning…

Descriptors: Electronic Learning, Testing Programs, Participant Satisfaction, Teacher Attitudes

Extended Time Testing Accommodations for Students with Disabilities: Answers to Five Fundamental Questions

Peer reviewed

Direct link

Lovett, Benjamin J. – Review of Educational Research, 2010

Extended time is one of the most common testing accommodations provided to students with disabilities. It is also controversial; critics of extended time accommodations argue that extended time is used too readily, without concern for how it changes the skills measured by tests, leading to scores that cannot be compared fairly with those of other…

Descriptors: Testing Accommodations, Academic Accommodations (Disabilities), Literature Reviews, Meta Analysis

Modeling Change in Large-Scale Longitudinal Studies of Educational Growth: Four Decades of Contributions to the Assessment of Educational Growth. Research Report. ETS RR-12-04. ETS R&D Scientific and Policy Contributions Series. ETS SPC-12-01

Peer reviewed
PDF on ERIC

Download full text

Rock, Donald A. – ETS Research Report Series, 2012

This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…

Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Detecting and Correcting Scale Drift in Test Equating: An Illustration from a Large Scale Testing Program

Peer reviewed

Direct link

Puhan, Gautam – Applied Measurement in Education, 2009

The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…

Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational Measurement:…	5
Journal of Applied Testing…	5
Educational and Psychological…	4
Applied Measurement in…	3
Applied Psychological…	3
Educational Testing Service	3
Behavioral Research and…	2
Education Policy Analysis…	2
International Journal of…	2
ProQuest LLC	2
APA Books	1
Anatomical Sciences Education	1
ETS Research Report Series	1
Education and Urban Society	1
Florida Journal of…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Information…	1
Journal of Research in…	1
Learning Disabilities: A…	1
OECD Publishing	1
Personnel and Guidance Journal	1
Practical Assessment,…	1
Psychometrika	1
Regional Educational…	1
More ▼

Dorans, Neil J.	3
Thurlow, Martha	3
Alonzo, Julie	2
Anderson, Daniel	2
Bielinski, John	2
Blai, Boris, Jr.	2
Dings, Jonathan	2
Huynh, Huynh	2
Jamgochian, Elisa	2
Kamata, Akihito	2
Lai, Cheng-Fei	2
LeMahieu, Paul G.	2
Mehrens, William A.	2
Minnema, Jane	2
Nese, Joseph F. T.	2
Rock, Donald A.	2
Saez, Leilani	2
Tindal, Gerald	2
Wyse, Adam E.	2
Ackermann, Richard	1
Ananda, Sri	1
Angoff, William H., Ed.	1
Baggett, W. Jay	1
Bennett, Randy Elliot	1
More ▼

Journal Articles	37
Reports - Research	25
Reports - Evaluative	24
Reports - Descriptive	18
Speeches/Meeting Papers	12
Numerical/Quantitative Data	7
Opinion Papers	7
Guides - Non-Classroom	3
Book/Product Reviews	2
Dissertations/Theses -…	2
Information Analyses	2
Tests/Questionnaires	2
Books	1
Collected Works - General	1
Computer Programs	1
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
Guides - General	1
Reference Materials -…	1
More ▼