ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	5

Descriptor

Goodness of Fit	6
Simulation	6
Test Validity	6
Scores	5
Test Items	4
Item Analysis	3
Item Response Theory	3
Psychometrics	3
Factor Analysis	2
Test Length	2
Test Reliability	2
Achievement Tests	1
Anxiety	1
Cheating	1
Comparative Analysis	1
Computation	1
Factor Structure	1
Guessing (Tests)	1
Heuristics	1
Inferences	1
Information Systems	1
Language Tests	1
Least Squares Statistics	1
Mathematical Models	1
Mathematics	1
More ▼

Source

Educational and Psychological…	1
International Educational…	1
Journal of Educational…	1
Journal of Educational and…	1
ProQuest LLC	1

Author

DeMars, Christine E.	1
Eckerly, Carol	1
Edwards, Michael C.	1
Gorney, Kylie	1
Leite, Walter L.	1
Marcoulides, Katerina M.	1
Raborn, Anthony W.	1
Reckase, Mark D.	1
Sinharay, Sandip	1
Stanley, Leanne M.	1
Steinkamp, Susan Christa	1
Wise, Steven L.	1
Wollack, James A.	1
More ▼

Publication Type

Reports - Research	4
Journal Articles	3
Dissertations/Theses -…	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Using Item Scores and Distractors to Detect Item Compromise and Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A.; Sinharay, Sandip; Eckerly, Carol – Journal of Educational and Behavioral Statistics, 2023

Any time examinees have had access to items and/or answers prior to taking a test, the fairness of the test and validity of test score interpretations are threatened. Therefore, there is a high demand for procedures to detect both compromised items (CI) and examinees with preknowledge (EWP). In this article, we develop a procedure that uses item…

Descriptors: Scores, Test Validity, Test Items, Prior Learning

A Comparison of Automated Scale Short Form Selection Strategies

Peer reviewed
PDF on ERIC

Download full text

Raborn, Anthony W.; Leite, Walter L.; Marcoulides, Katerina M. – International Educational Data Mining Society, 2019

Short forms of psychometric scales have been commonly used in educational and psychological research to reduce the burden of test administration. However, it is challenging to select items for a short form that preserve the validity and reliability of the scores of the original scale. This paper presents and evaluates multiple automated methods…

Descriptors: Psychometrics, Measures (Individuals), Mathematics, Heuristics

Identifying Aberrant Responding: Use of Multiple Measures

Direct link

Steinkamp, Susan Christa – ProQuest LLC, 2017

For test scores that rely on the accurate estimation of ability via an IRT model, their use and interpretation is dependent upon the assumption that the IRT model fits the data. Examinees who do not put forth full effort in answering test questions, have prior knowledge of test content, or do not approach a test with the intent of answering…

Descriptors: Test Items, Item Response Theory, Scores, Test Wiseness

Reliability and Model Fit

Peer reviewed

Direct link

Stanley, Leanne M.; Edwards, Michael C. – Educational and Psychological Measurement, 2016

The purpose of this article is to highlight the distinction between the reliability of test scores and the fit of psychometric measurement models, reminding readers why it is important to consider both when evaluating whether test scores are valid for a proposed interpretation and/or use. It is often the case that an investigator judges both the…

Descriptors: Test Reliability, Goodness of Fit, Scores, Patients

An Application of Item Response Time: The Effort-Moderated IRT Model

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Journal of Educational Measurement, 2006

The validity of inferences based on achievement test scores is dependent on the amount of effort that examinees put forth while taking the test. With low-stakes tests, for which this problem is particularly prevalent, there is a consequent need for psychometric models that can take into account differing levels of examinee effort. This article…

Descriptors: Guessing (Tests), Psychometrics, Inferences, Reaction Time

A Comparison of the One- and Three-Parameter Logistic Models for Item Calibration.

Download full text

Reckase, Mark D. – 1978

Five comparisons were made relative to the quality of estimates of ability parameters and item calibrations obtained from the one-parameter and three-parameter logistic models. The results indicate: (1) The three-parameter model fit the test data better in all cases than did the one-parameter model. For simulation data sets, multi-factor data were…

Descriptors: Comparative Analysis, Goodness of Fit, Item Analysis, Mathematical Models