ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	15

Descriptor

Test Items	50
True Scores	50
Item Response Theory	27
Equated Scores	18
Error of Measurement	13
Difficulty Level	11
Comparative Analysis	9
Mathematical Models	9
Statistical Analysis	9
Test Construction	9
Test Reliability	8
Estimation (Mathematics)	7
Item Analysis	7
Latent Trait Theory	7
Test Format	6
Ability	5
College Entrance Examinations	5
Educational Assessment	5
Multiple Choice Tests	5
Scoring	5
Simulation	5
Test Theory	5
Achievement Tests	4
Computation	4
Guessing (Tests)	4
More ▼

Source

ETS Research Report Series	6
Educational and Psychological…	6
Journal of Educational…	4
Applied Psychological…	3
Applied Measurement in…	2
Higher Education Studies	1
International Journal of…	1
International Journal of…	1
ProQuest LLC	1
Psychometrika	1

Publication Type

Reports - Research	32
Journal Articles	23
Reports - Evaluative	14
Speeches/Meeting Papers	13
Numerical/Quantitative Data	4
Collected Works - General	2
Reports - Descriptive	2
Dissertations/Theses -…	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Secondary Education	2
Secondary Education	2
High Schools	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	4
Advanced Placement…	3
SAT (College Admission Test)	3
ACT Assessment	1
College Level Examination…	1
Iowa Tests of Educational…	1
Medical College Admission Test	1
North Carolina End of Course…	1

What Works Clearinghouse Rating

Test Items X

Showing 1 to 15 of 50 results Save | Export

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

A Dialectic on Validity: Explanation-Focused and the Many Ways of Being Human

Peer reviewed
PDF on ERIC

Download full text

Bruno D. Zumbo – International Journal of Assessment Tools in Education, 2023

In line with the journal volume's theme, this essay considers lessons from the past and visions for the future of test validity. In the first part of the essay, a description of historical trends in test validity since the early 1900s leads to the natural question of whether the discipline has progressed in its definition and description of test…

Descriptors: Test Theory, Test Validity, True Scores, Definitions

Alternative Methods for Item Parameter Estimation: From CTT to IRT. Research Report. ETS RR-22-12

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Lu, Ru; Johnson, Matthew S.; McCaffrey, Dan F. – ETS Research Report Series, 2022

It is desirable for an educational assessment to be constructed of items that can differentiate different performance levels of test takers, and thus it is important to estimate accurately the item discrimination parameters in either classical test theory or item response theory. It is particularly challenging to do so when the sample sizes are…

Descriptors: Test Items, Item Response Theory, Item Analysis, Educational Assessment

Should Items and Answer Keys of Small-Scale Exams Be Published?

Peer reviewed
PDF on ERIC

Download full text

Selvi, Hüseyin – Higher Education Studies, 2020

This study aimed to examine the effect of using items from previous exams on students? pass-fail rates and on the psychometric properties of the tests and items. The study included data from 115 tests and 11,500 items used in the midterm and final exams of 3,910 students in the preclinical term at the Faculty of Medicine from 2014 to 2019. Data…

Descriptors: Answer Keys, Tests, Test Items, True Scores

On True Score Evaluation Using Item Response Theory Modeling

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Harrison, Michael – Educational and Psychological Measurement, 2019

Building on prior research on the relationships between key concepts in item response theory and classical test theory, this note contributes to highlighting their important and useful links. A readily and widely applicable latent variable modeling procedure is discussed that can be used for point and interval estimation of the individual person…

Descriptors: True Scores, Item Response Theory, Test Items, Test Theory

Modeling of Item Response Functions under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2020

This study presents new models for item response functions (IRFs) in the framework of the D-scoring method (DSM) that is gaining attention in the field of educational and psychological measurement and largescale assessments. In a previous work on DSM, the IRFs of binary items were estimated using a logistic regression model (LRM). However, the LRM…

Descriptors: Item Response Theory, Scoring, True Scores, Scaling

Asymptotic Standard Errors for Item Response Theory True Score Equating of Polytomous Items

Peer reviewed

Direct link

Cher Wong, Cheow – Journal of Educational Measurement, 2015

Building on previous works by Lord and Ogasawara for dichotomous items, this article proposes an approach to derive the asymptotic standard errors of item response theory true score equating involving polytomous items, for equivalent and nonequivalent groups of examinees. This analytical approach could be used in place of empirical methods like…

Descriptors: Item Response Theory, Error of Measurement, True Scores, Equated Scores

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Controlling Type I Error Rates in Assessing DIF for Logistic Regression Method Combined with SIBTEST Regression Correction Procedure and DIF-Free-Then-DIF Strategy

Peer reviewed

Direct link

Shih, Ching-Lin; Liu, Tien-Hsiang; Wang, Wen-Chung – Educational and Psychological Measurement, 2014

The simultaneous item bias test (SIBTEST) method regression procedure and the differential item functioning (DIF)-free-then-DIF strategy are applied to the logistic regression (LR) method simultaneously in this study. These procedures are used to adjust the effects of matching true score on observed score and to better control the Type I error…

Descriptors: Test Bias, Regression (Statistics), Test Items, True Scores

Weighting Test Samples in IRT Linking and Equating: Toward an Improved Sampling Design for Complex Equating. Research Report. ETS RR-13-39

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013

Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…

Descriptors: Item Response Theory, Test Items, Sampling, True Scores

Reliability and Attribute-Based Scoring in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Gierl, Mark J.; Cui, Ying; Zhou, Jiawen – Journal of Educational Measurement, 2009

The attribute hierarchy method (AHM) is a psychometric procedure for classifying examinees' test item responses into a set of structured attribute patterns associated with different components from a cognitive model of task performance. Results from an AHM analysis yield information on examinees' cognitive strengths and weaknesses. Hence, the AHM…

Descriptors: Test Items, True Scores, Psychometrics, Algebra

The Impact of Equating Method and Format Representation of Common Items on the Adequacy of Mixed-Format Test Equating Using Nonequivalent Groups

Direct link

Hagge, Sarah Lynn – ProQuest LLC, 2010

Mixed-format tests containing both multiple-choice and constructed-response items are widely used on educational tests. Such tests combine the broad content coverage and efficient scoring of multiple-choice items with the assessment of higher-order thinking skills thought to be provided by constructed-response items. However, the combination of…

Descriptors: Test Format, True Scores, Equated Scores, Psychometrics

An Equipercentile Version of the Levine Linear Observed-Score Equating Function Using the Methods of Kernel Equating. Research Report. ETS RR-07-14

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Fournier-Zajac, Stephanie; Holland, Paul W. – ETS Research Report Series, 2007

In the nonequivalent groups with anchor test (NEAT) design, there are several ways to use the information provided by the anchor in the equating process. One of the NEAT-design equating methods is the linear observed-score Levine method (Kolen & Brennan, 2004). It is based on a classical test theory model of the true scores on the test forms…

Descriptors: Equated Scores, Statistical Analysis, Test Items, Test Theory

Reliability and the Nonequivalent Groups with Anchor Test Design. Research Report. ETS RR-07-16

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007

This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…

Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis

Number Correct Scoring: Comparison between Classical True Score Theory and Multidimensional Item Response Theory.

Download full text

Rotou, Ourania; Elmore, Patricia B.; Headrick, Todd C. – 2001

This study investigated the number-correct scoring method based on different theories (classical true-score theory and multidimensional item response theory) when a standardized test requires more than one ability for an examinee to get a correct response. The number-correct scoring procedure that is widely used is the one that is defined in…

Descriptors: Item Response Theory, Scoring, Standardized Tests, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Dimitrov, Dimiter M.	4
von Davier, Alina A.	3
Cliff, Norman	2
Donoghue, John R.	2
Gierl, Mark J.	2
Kolen, Michael J.	2
Wilcox, Rand R.	2
Yang, Wen-Ling	2
Bolt, Daniel M.	1
Boughton, Keith A.	1
Brennan, Robert L.	1
Bruno D. Zumbo	1
Cher Wong, Cheow	1
Cohen, Allan S.	1
Cui, Ying	1
Debelak, Rudolf	1
Divgi, D. R.	1
Douglass, James B.	1
Eignor, Daniel R.	1
Elmore, Patricia B.	1
Fellinghauer, Carolina	1
Fournier-Zajac, Stephanie	1
Gialluca, Kathleen A.	1
Gotzmann, Andrea	1
More ▼