ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	16

Descriptor

Correlation	20
Difficulty Level	20
Simulation	20
Test Items	16
Item Response Theory	9
Comparative Analysis	7
Sample Size	6
Equated Scores	4
Item Analysis	4
Accuracy	3
Computation	3
Computer Assisted Testing	3
Error of Measurement	3
Models	3
Psychometrics	3
Test Bias	3
Ability	2
Guessing (Tests)	2
Knowledge Level	2
Latent Trait Theory	2
Mathematical Formulas	2
Maximum Likelihood Statistics	2
Regression (Statistics)	2
Robustness (Statistics)	2
Scores	2
More ▼

Source

ETS Research Report Series	4
Educational and Psychological…	2
Journal of Educational…	2
ProQuest LLC	2
Educational Measurement:…	1
International Journal of…	1
International Journal of…	1
Journal of Vocational Behavior	1
Quality Assurance in…	1
Research Matters	1

Publication Type

Reports - Research	15
Journal Articles	14
Reports - Evaluative	3
Speeches/Meeting Papers	3
Dissertations/Theses -…	2
Numerical/Quantitative Data	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1
Two Year Colleges	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Investigation of the Effect of Parameter Estimation and Classification Accuracy in Mixture IRT Models under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022

This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…

Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Computerized Adaptive Testing in Early Education: Exploring the Impact of Item Position Effects on Ability Estimation

Peer reviewed

Direct link

Albano, Anthony D.; Cai, Liuhan; Lease, Erin M.; McConnell, Scott R. – Journal of Educational Measurement, 2019

Studies have shown that item difficulty can vary significantly based on the context of an item within a test form. In particular, item position may be associated with practice and fatigue effects that influence item parameter estimation. The purpose of this research was to examine the relevance of item position specifically for assessments used in…

Descriptors: Test Items, Computer Assisted Testing, Item Analysis, Difficulty Level

Unidimensional IRT Item Parameter Estimates across Equivalent Test Forms with Confounding Specifications within Dimensions

Peer reviewed

Direct link

Matlock, Ki Lynn; Turner, Ronna – Educational and Psychological Measurement, 2016

When constructing multiple test forms, the number of items and the total test difficulty are often equivalent. Not all test developers match the number of items and/or average item difficulty within subcontent areas. In this simulation study, six test forms were constructed having an equal number of items and average item difficulty overall.…

Descriptors: Item Response Theory, Computation, Test Items, Difficulty Level

The Development of a Content Assessment of Basic Electronics Knowledge. Research Report. ETS RR-20-28

Peer reviewed
PDF on ERIC

Download full text

Steinberg, Jonathan; Andrews-Todd, Jessica; Forsyth, Carolyn; Chamberlain, John; Horwitz, Paul; Koon, Al; Rupp, Andre; McCulla, Laura – ETS Research Report Series, 2020

This study discusses the development of a basic electronics knowledge (BEK) assessment as a pretest activity for undergraduate students in engineering and related fields. The 28 BEK items represent 12 key concepts, including properties of serial circuits, knowledge of electrical laws (e.g., Kirchhoff 's and Ohm's laws), and properties of digital…

Descriptors: Knowledge Level, Skill Development, Psychometrics, Student Evaluation

Estimating Item Difficulty with Comparative Judgments. Research Report. ETS RR-14-39

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014

Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…

Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations

A Comparison of Different Psychometric Approaches to Modeling Testlet Structures: An Example with C-Tests

Peer reviewed

Direct link

Schroeders, Ulrich; Robitzsch, Alexander; Schipolowski, Stefan – Journal of Educational Measurement, 2014

C-tests are a specific variant of cloze tests that are considered time-efficient, valid indicators of general language proficiency. They are commonly analyzed with models of item response theory assuming local item independence. In this article we estimated local interdependencies for 12 C-tests and compared the changes in item difficulties,…

Descriptors: Comparative Analysis, Psychometrics, Cloze Procedure, Language Tests

Equating Multidimensional Tests under a Random Groups Design: A Comparison of Various Equating Procedures

Direct link

Lee, Eunjung – ProQuest LLC, 2013

The purpose of this research was to compare the equating performance of various equating procedures for the multidimensional tests. To examine the various equating procedures, simulated data sets were used that were generated based on a multidimensional item response theory (MIRT) framework. Various equating procedures were examined, including…

Descriptors: Equated Scores, Tests, Comparative Analysis, Item Response Theory

The Performance of the Linear Logistic Test Model When the Q-Matrix Is Misspecified: A Simulation Study

Direct link

MacDonald, George T. – ProQuest LLC, 2014

A simulation study was conducted to explore the performance of the linear logistic test model (LLTM) when the relationships between items and cognitive components were misspecified. Factors manipulated included percent of misspecification (0%, 1%, 5%, 10%, and 15%), form of misspecification (under-specification, balanced misspecification, and…

Descriptors: Simulation, Item Response Theory, Models, Test Items

Fitting the Rasch Model to Account for Variation in Item Discrimination

Peer reviewed

Direct link

Weitzman, R. A. – Educational and Psychological Measurement, 2009

Building on the Kelley and Gulliksen versions of classical test theory, this article shows that a logistic model having only a single item parameter can account for varying item discrimination, as well as difficulty, by using item-test correlations to adjust incorrect-correct (0-1) item responses prior to an initial model fit. The fit occurs…

Descriptors: Item Response Theory, Test Items, Difficulty Level, Test Bias

Core Self-Evaluations as Causes of Satisfaction: The Mediating Role of Seeking Task Complexity

Peer reviewed

Direct link

Srivastava, Abhishek; Locke, Edwin A.; Judge, Timothy A.; Adams, John W. – Journal of Vocational Behavior, 2010

This study examined the mediating role of task complexity in the relationship between core self-evaluations (CSE) and satisfaction. In Study 1, eighty three undergraduate business students worked on a strategic decision-making simulation. The simulated environment enabled us to verify the temporal sequence of variables, use an objective measure of…

Descriptors: Job Satisfaction, Difficulty Level, Simulated Environment, Self Evaluation (Individuals)

Validation of a Computerized Cognitive Assessment System for Persons with Stroke: A Pilot Study

Peer reviewed

Direct link

Yip, Chi Kwong; Man, David W. K. – International Journal of Rehabilitation Research, 2009

This study investigates the validity of a newly developed computerized cognitive assessment system (CCAS) that is equipped with rich multimedia to generate simulated testing situations and considers both test item difficulty and the test taker's ability. It is also hypothesized that better predictive validity of the CCAS in self-care of persons…

Descriptors: Test Items, Content Validity, Predictive Validity, Patients

Robinson's Measure of Agreement as a Parallel Forms Reliability Coefficient.

Download full text

Willson, Victor L. – 1977

A major deficiency in classical test theory is the reliance on Pearson product-moment (PPM) correlation concepts in the definition of reliability. PPM measures are totally insensitive to first moment differences in tests which leads to the dubious assumption of essential tan-equivalence. Robinson proposed a measure of agreement that is sensitive…

Descriptors: Comparative Analysis, Correlation, Difficulty Level, Mathematical Formulas

Previous Page | Next Page »

Pages: 1 | 2

Sinharay, Sandip	3
Holland, Paul	2
Adams, John W.	1
Albano, Anthony D.	1
Andrews-Todd, Jessica	1
Atar, Hakan Yavuz	1
Attali, Yigal	1
Bramley, Tom	1
Cai, Liuhan	1
Chamberlain, John	1
Forsyth, Carolyn	1
Horwitz, Paul	1
Hsu, Tse-Chi	1
Jackson, Carol	1
Johnson, Matthew S.	1
Jones, Patricia B.	1
Judge, Timothy A.	1
Kirisci, Levent	1
Koch, William R.	1
Koon, Al	1
Kárász, Judit T.	1
Lease, Erin M.	1
Lee, Eunjung	1
Locke, Edwin A.	1
More ▼