ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	14

Descriptor

Psychometrics	18
Scores	18
Test Construction	6
Item Response Theory	5
Test Items	5
Test Theory	4
Tests	4
Achievement Tests	3
Educational Assessment	3
Educational Testing	3
Measurement	3
Test Reliability	3
Test Use	3
Test Validity	3
Testing	3
Error of Measurement	2
Evaluation	2
Foreign Countries	2
International Assessment	2
Multiple Choice Tests	2
Reliability	2
Reports	2
Secondary School Students	2
Teaching Methods	2
Test Bias	2
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	18
Reports - Descriptive	6
Reports - Research	6
Reports - Evaluative	5
Opinion Papers	1

Education Level

Secondary Education

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Program for International…	2
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Evaluating Item Fit Statistic Thresholds in PISA: Analysis of Cross-Country Comparability of Cognitive Items

Peer reviewed

Direct link

Joo, Seang-Hwane; Khorramdel, Lale; Yamamoto, Kentaro; Shin, Hyo Jeong; Robin, Frederic – Educational Measurement: Issues and Practice, 2021

In Programme for International Student Assessment (PISA), item response theory (IRT) scaling is used to examine the psychometric properties of items and scales and to provide comparable test scores across participating countries and over time. To balance the comparability of IRT item parameter estimations across countries with the best possible…

Descriptors: Foreign Countries, International Assessment, Achievement Tests, Secondary School Students

Affordances of Item Formats and Their Effects on Test-Taker Cognition under Uncertainty

Peer reviewed

Direct link

Moon, Jung Aa; Keehner, Madeleine; Katz, Irvin R. – Educational Measurement: Issues and Practice, 2019

The current study investigated how item formats and their inherent affordances influence test-takers' cognition under uncertainty. Adult participants solved content-equivalent math items in multiple-selection multiple-choice and four alternative grid formats. The results indicated that participants' affirmative response tendency (i.e., judge the…

Descriptors: Affordances, Test Items, Test Format, Test Wiseness

How Robust Are Cross-Country Comparisons of PISA Scores to the Scaling Model Used?

Peer reviewed

Direct link

Jerrim, John; Parker, Philip; Choi, Alvaro; Chmielewski, Anna Katyn; Sälzer, Christine; Shure, Nikki – Educational Measurement: Issues and Practice, 2018

The Programme for International Student Assessment (PISA) is an important international study of 15-olds' knowledge and skills. New results are released every 3 years, and have a substantial impact upon education policy. Yet, despite its influence, the methodology underpinning PISA has received significant criticism. Much of this criticism has…

Descriptors: Educational Assessment, Comparative Education, Achievement Tests, Foreign Countries

Using Evidence-Centered Design to Create a Special Educator Observation System

Peer reviewed

Direct link

Johnson, Evelyn S.; Crawford, Angela; Moylan, Laura A.; Zheng, Yuzhu – Educational Measurement: Issues and Practice, 2018

The evidence-centered design framework was used to create a special education teacher observation system, Recognizing Effective Special Education Teachers. Extensive reviews of research informed the domain analysis and modeling stages, and led to the conceptual framework in which effective special education teaching is operationalized as the…

Descriptors: Evidence Based Practice, Special Education Teachers, Observation, Disabilities

Developing Test Score Reports that Work: The Process and Best Practices for Effective Communication

Peer reviewed

Direct link

Zenisky, April L.; Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 2012

Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…

Descriptors: Scores, Psychometrics, Test Results, Usability

An NCME Instructional Module on Subscores

Peer reviewed

Direct link

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Educational Measurement: Issues and Practice, 2011

The purpose of this ITEMS module is to provide an introduction to subscores. First, examples of subscores from an operational test are provided. Then, a review of methods that can be used to examine if subscores have adequate psychometric quality is provided. It is demonstrated, using results from operational and simulated data, that subscores…

Descriptors: Scores, Psychometrics, Tests, Data

Comments on Neil Dorans's NCME Career Award Address: The Contestant Perspective on Taking Tests--Emanations from the Statue within

Peer reviewed

Direct link

Mislevy, Robert J. – Educational Measurement: Issues and Practice, 2012

This article presents the author's observations on Neil Dorans's NCME Career Award Address: "The Contestant Perspective on Taking Tests: Emanations from the Statue within." He calls attention to some points that Dr. Dorans made in his address, and offers his thoughts in response.

Descriptors: Testing, Test Reliability, Psychometrics, Scores

Psychometric Properties of IRT Proficiency Estimates

Peer reviewed

Direct link

Kolen, Michael J.; Tong, Ye – Educational Measurement: Issues and Practice, 2010

Psychometric properties of item response theory proficiency estimates are considered in this paper. Proficiency estimators based on summed scores and pattern scores include non-Bayes maximum likelihood and test characteristic curve estimators and Bayesian estimators. The psychometric properties investigated include reliability, conditional…

Descriptors: Test Length, Psychometrics, Item Response Theory, Scores

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Developing Score Reports for Cognitive Diagnostic Assessments

Peer reviewed

Direct link

Roberts, Mary Roduta; Gierl, Mark J. – Educational Measurement: Issues and Practice, 2010

This paper presents a framework to provide a structured approach for developing score reports for cognitive diagnostic assessments ("CDAs"). Guidelines for reporting and presenting diagnostic scores are based on a review of current educational test score reporting practices and literature from the area of information design. A sample diagnostic…

Descriptors: Diagnostic Tests, Scores, Technical Writing, Cognitive Tests

Measurement Invariance in Confirmatory Factor Analysis: An Illustration Using IQ Test Performance of Minorities

Peer reviewed

Direct link

Wicherts, Jelte M.; Dolan, Conor V. – Educational Measurement: Issues and Practice, 2010

Measurement invariance with respect to groups is an essential aspect of the fair use of scores of intelligence tests and other psychological measurements. It is widely believed that equal factor loadings are sufficient to establish measurement invariance in confirmatory factor analysis. Here, it is shown why establishing measurement invariance…

Descriptors: Factor Structure, Intelligence Tests, Intelligence Quotient, Factor Analysis

Consequences of Test Score Use as Validity Evidence: Roles and Responsibilities

Peer reviewed

Direct link

Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009

This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…

Descriptors: Tests, Test Validity, Scores, Data Collection

Instructional Sensitivity as a Psychometric Property of Assessments

Peer reviewed

Direct link

Polikoff, Morgan S. – Educational Measurement: Issues and Practice, 2010

Standards-based reform, as codified by the No Child Left Behind Act, relies on the ability of assessments to accurately reflect the learning that takes place in U.S. classrooms. However, this property of assessments--their instructional sensitivity--is rarely, if ever, investigated by test developers, states, or researchers. In this paper, the…

Descriptors: Federal Legislation, Psychometrics, Accountability, Teaching Methods

Three Options Are Optimal for Multiple-Choice Items: A Meta-Analysis of 80 Years of Research

Peer reviewed

Direct link

Rodriguez, Michael C. – Educational Measurement: Issues and Practice, 2005

Multiple-choice items are a mainstay of achievement testing. The need to adequately cover the content domain to certify achievement proficiency by producing meaningful precise scores requires many high-quality items. More 3-option items can be administered than 4- or 5-option items per testing time while improving content coverage, without…

Descriptors: Psychometrics, Testing, Scores, Test Construction

A Perspective on the History of Generalizability Theory.

Peer reviewed

Brennan, Robert L. – Educational Measurement: Issues and Practice, 1997

The history of generalizability theory (G theory) is told from the perspective of one researcher's experiences, describing psychometric and scientific perspectives that influenced the development of G theory and its adoption. Work that remains to be done in the field is outlined. (SLD)

Descriptors: Educational Testing, Generalizability Theory, Measurement, Psychometrics

Previous Page | Next Page »

Pages: 1 | 2

Kolen, Michael J.	2
Bock, R. Darrell	1
Brennan, Robert L.	1
Chmielewski, Anna Katyn	1
Choi, Alvaro	1
Cizek, Gregory J.	1
Crawford, Angela	1
Crocker, Linda	1
Dolan, Conor V.	1
Frisbie, David A.	1
Gierl, Mark J.	1
Haberman, Shelby J.	1
Hambleton, Ronald K.	1
Jerrim, John	1
Johnson, Evelyn S.	1
Joo, Seang-Hwane	1
Katz, Irvin R.	1
Keehner, Madeleine	1
Khorramdel, Lale	1
Lee, Won-Chan	1
Mehrens, William A.	1
Mislevy, Robert J.	1
Moon, Jung Aa	1
Moylan, Laura A.	1
Nichols, Paul D.	1
More ▼