ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	23

Descriptor

Test Items	46
Weighted Scores	46
Test Reliability	14
Scoring	13
Item Analysis	12
Item Response Theory	11
Test Construction	11
Test Validity	10
Multiple Choice Tests	9
Statistical Analysis	9
Correlation	7
Evaluation Methods	7
Comparative Analysis	6
Computer Assisted Testing	6
Foreign Countries	6
Higher Education	6
Scores	6
Scoring Formulas	6
Simulation	6
Achievement Tests	5
Computation	5
Models	5
Predictive Validity	5
Test Format	5
Difficulty Level	4
More ▼

Publication Type

Journal Articles	27
Reports - Research	27
Reports - Evaluative	9
Speeches/Meeting Papers	5
Reports - Descriptive	3
Collected Works - General	2
Books	1
Guides - Non-Classroom	1
Non-Print Media	1
Reference Materials - General	1

Education Level

Higher Education	5
Secondary Education	5
Postsecondary Education	4
Elementary Education	3
Elementary Secondary Education	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Adult Education	1
Grade 4	1
High Schools	1
Intermediate Grades	1
More ▼

Audience

Location

Asia	2
Europe	2
Latin America	2
Australia	1
Colombia	1
Pennsylvania	1
Thailand	1

Laws, Policies, & Programs

Education for All Handicapped…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	2
International Association for…	1
New Jersey College Basic…	1
North Carolina End of Course…	1
Progress in International…	1
Self Description Questionnaire	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 46 results Save | Export

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

Effect of Statistically Matching Equating Samples for Common-Item Equating. Research Report. ETS RR-21-02

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Kim, Sooyeon – ETS Research Report Series, 2021

This study evaluated the impact of subgroup weighting for equating through a common-item anchor. We used data from a single test form to create two research forms for which the equating relationship was known. The results showed that equating was most accurate when the new form and reference form samples were weighted to be similar to the target…

Descriptors: Equated Scores, Weighted Scores, Raw Scores, Test Items

Using Weighted Sum Scores to Close the Gap between DIF Practice and Theory

Peer reviewed

Direct link

Guo, Hongwen; Dorans, Neil J. – Journal of Educational Measurement, 2020

We make a distinction between the operational practice of using an observed score to assess differential item functioning (DIF) and the concept of departure from measurement invariance (DMI) that conditions on a latent variable. DMI and DIF indices of effect sizes, based on the Mantel-Haenszel test of common odds ratio, converge under restricted…

Descriptors: Weighted Scores, Test Items, Item Response Theory, Measurement

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

Optimal Weighting for Exam Composition

Peer reviewed
PDF on ERIC

Download full text

Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018

A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…

Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

What Differential Weighting of Subsets of Items Does and Does Not Accomplish: Geometric Explanation. Research Report. ETS RR-14-20

Peer reviewed
PDF on ERIC

Download full text

Carlson, James E. – ETS Research Report Series, 2014

A little-known theorem, a generalization of Pythagoras's theorem, due to Pappus, is used to present a geometric explanation of various definitions of the contribution of component tests to their composite. I show that an unambiguous definition of the unique contribution of a component to the composite score variance is present if and only if the…

Descriptors: Geometric Concepts, Scores, Validity, Reliability

ICCS 2016 User Guide for the International Database. IEA International Civic and Citizenship Education Study 2016

Download full text

Köhler, Hannah, Ed.; Weber, Sabine, Ed.; Brese, Falk, Ed.; Schulz, Wolfram, Ed.; Carstens, Ralph, Ed. – International Association for the Evaluation of Educational Achievement, 2018

The IEA's International Civic and Citizenship Education Study (ICCS) investigates the ways in which young people are prepared to undertake their roles as citizens in a range of countries in the second decade of the 21st century. ICCS 2016 is the second cycle of a study initiated in 2009. The ICCS 2016 user guide describes the content and format of…

Descriptors: Guides, Citizenship Education, Citizen Participation, Citizenship Responsibility

Reliability and Validity of International Large-Scale Assessment: Understanding IEA's Comparative Studies of Student Achievement. IEA Research for Education. Volume 10

Download full text

Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020

Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…

Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis

ICCS 2016 Technical Report. IEA International Civic and Citizenship Education Study 2016

Download full text

Schulz, Wolfram, Ed.; Losito, Bruno, Ed.; Carstens, Ralph, Ed.; Fraillon, Julian, Ed. – International Association for the Evaluation of Educational Achievement, 2018

The IEA's International Civic and Citizenship Education Study (ICCS) investigates the ways in which young people are prepared to undertake their roles as citizens in a range of countries in the second decade of the 21st century. ICCS 2016 is the second cycle of a study initiated in 2009. This technical report follows the publication of several…

Descriptors: Citizenship Education, Comparative Education, Citizen Participation, Citizenship Responsibility

Item-Weighted Likelihood Method for Ability Estimation in Tests Composed of Both Dichotomous and Polytomous Items

Peer reviewed

Direct link

Tao, Jian; Shi, Ning-Zhong; Chang, Hua-Hua – Journal of Educational and Behavioral Statistics, 2012

For mixed-type tests composed of both dichotomous and polytomous items, polytomous items often yield more information than dichotomous ones. To reflect the difference between the two types of items, polytomous items are usually pre-assigned with larger weights. We propose an item-weighted likelihood method to better assess examinees' ability…

Descriptors: Test Items, Weighted Scores, Maximum Likelihood Statistics, Statistical Bias

Aligning English Language Testing with Curriculum

Peer reviewed
PDF on ERIC

Download full text

Palacio, Marcela; Gaviria, Sandra; Brown, James Dean – PROFILE: Issues in Teachers' Professional Development, 2016

Frustrations with traditional testing led a group of teachers at the English for adults program at Universidad EAFIT (Colombia) to design tests aligned with the institutional teaching philosophy and classroom practices. This article reports on a study of an item-by-item evaluation of a series of English exams for validity and reliability in an…

Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Second Language Instruction

The Effect of Changing Content on IRT Scaling Methods

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert R. – Applied Measurement in Education, 2015

Equating test forms is an essential activity in standardized testing, with increased importance with the accountability systems in existence through the mandate of Adequate Yearly Progress. It is through equating that scores from different test forms become comparable, which allows for the tracking of changes in the performance of students from…

Descriptors: Item Response Theory, Rating Scales, Standardized Tests, Scoring Rubrics

Weighting Test Samples in IRT Linking and Equating: Toward an Improved Sampling Design for Complex Equating. Research Report. ETS RR-13-39

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013

Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…

Descriptors: Item Response Theory, Test Items, Sampling, True Scores

Examining the Impact of Drifted Polytomous Anchor Items on Test Characteristic Curve (TCC) Linking and IRT True Score Equating. Research Report. ETS RR-12-09

Peer reviewed
PDF on ERIC

Download full text

Li, Yanmei – ETS Research Report Series, 2012

In a common-item (anchor) equating design, the common items should be evaluated for item parameter drift. Drifted items are often removed. For a test that contains mostly dichotomous items and only a small number of polytomous items, removing some drifted polytomous anchor items may result in anchor sets that no longer resemble mini-versions of…

Descriptors: Scores, Item Response Theory, Equated Scores, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Applied Psychological…	5
ETS Research Report Series	5
Educational and Psychological…	3
International Association for…	3
Applied Measurement in…	2
College Board	2
Journal of Educational…	2
Journal of Educational and…	2
Assessment	1
Education Sciences	1
Educational Assessment	1
Evaluation and the Health…	1
Innovations in Education and…	1
Journal of Marketing for…	1
Journal of Marriage and the…	1
Measurement & Evaluation in…	1
PROFILE: Issues in Teachers'…	1
Pearson	1
Practical Assessment,…	1
Psychometrika	1
More ▼

Chang, Hua-Hua	3
Carstens, Ralph, Ed.	2
Dorans, Neil J.	2
Downey, Ronald G.	2
Guo, Hongwen	2
Hendrickson, Amy	2
Keller, Lisa A.	2
Lu, Ru	2
Patterson, Brian	2
Schulz, Wolfram, Ed.	2
Shi, Ning-Zhong	2
Tao, Jian	2
Alderton, David L.	1
Baker, Frank B.	1
Bene, Nancy H.	1
Bock, R. Darrell	1
Brese, Falk, Ed.	1
Brown, James Dean	1
Budescu, David V.	1
Carlson, James E.	1
Chien, Yuehmei	1
Chuedoung, Meechoke	1
Claudy, John G.	1
Distefano, M. K., Jr.	1
More ▼