ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	16

Descriptor

Ability	39
Difficulty Level	39
Item Response Theory	39
Test Items	29
Estimation (Mathematics)	8
Models	7
Probability	7
Simulation	7
Test Construction	7
Adaptive Testing	6
Error of Measurement	6
Item Banks	6
Computer Assisted Testing	5
Equated Scores	5
Guessing (Tests)	5
Bayesian Statistics	4
Computation	4
Item Bias	4
Psychometrics	4
Scores	4
Change	3
Comparative Analysis	3
Correlation	3
Ethnicity	3
Higher Education	3
More ▼

Source

Educational and Psychological…	4
ProQuest LLC	4
Applied Measurement in…	3
Psychometrika	3
Journal of Applied Measurement	2
Measurement:…	2
Applied Psychological…	1
Educational Research and…	1
Educational Sciences: Theory…	1
Eurasian Journal of…	1
Grantee Submission	1
Higher Education Studies	1
International Journal of…	1
Journal of Research in…	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	17
Reports - Evaluative	16
Speeches/Meeting Papers	13
Dissertations/Theses -…	4
Reports - Descriptive	3
Numerical/Quantitative Data	1

Education Level

Elementary Education	2
Secondary Education	2
Early Childhood Education	1
Grade 3	1
Grade 4	1
Grade 7	1
Grade 8	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Location

South Africa	1
South Carolina	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

Connecticut Mastery Testing…	1
Medical College Admission Test	1
National Assessment of…	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 39 results Save | Export

The Effect of Item Pools of Different Strengths on the Test Results of Computerized-Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Kezer, Fatih – International Journal of Assessment Tools in Education, 2021

Item response theory provides various important advantages for exams carried out or to be carried out digitally. For computerized adaptive tests to be able to make valid and reliable predictions supported by IRT, good quality item pools should be used. This study examines how adaptive test applications vary in item pools which consist of items…

Descriptors: Item Banks, Adaptive Testing, Computer Assisted Testing, Item Response Theory

A Comparison of Kernel Equating and Item Response Theory Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Akin-Arikan, Çigdem; Gelbal, Selahattin – Eurasian Journal of Educational Research, 2021

Purpose: This study aims to compare the performances of Item Response Theory (IRT) equating and kernel equating (KE) methods based on equating errors (RMSD) and standard error of equating (SEE) using the anchor item nonequivalent groups design. Method: Within this scope, a set of conditions, including ability distribution, type of anchor items…

Descriptors: Equated Scores, Item Response Theory, Test Items, Statistical Analysis

Efficient Estimation of Mean Ability Growth Using Vertical Scaling

Peer reviewed

Direct link

Bjermo, Jonas; Miller, Frank – Applied Measurement in Education, 2021

In recent years, the interest in measuring growth in student ability in various subjects between different grades in school has increased. Therefore, good precision in the estimated growth is of importance. This paper aims to compare estimation methods and test designs when it comes to precision and bias of the estimated growth of mean ability…

Descriptors: Scaling, Ability, Computation, Test Items

The Interaction of Ability Differences and Guessing When Modeling Differential Item Functioning with the Rasch Model: Conventional and Tailored Calibration

Peer reviewed

Direct link

DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015

In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…

Descriptors: Test Bias, Guessing (Tests), Ability, Differences

Examining Differential Item Functions of Different Item Ordered Test Forms According to Item Difficulty Levels

Peer reviewed
PDF on ERIC

Download full text

Çokluk, Ömay; Gül, Emrah; Dogan-Gül, Çilem – Educational Sciences: Theory and Practice, 2016

The study aims to examine whether differential item function is displayed in three different test forms that have item orders of random and sequential versions (easy-to-hard and hard-to-easy), based on Classical Test Theory (CTT) and Item Response Theory (IRT) methods and bearing item difficulty levels in mind. In the correlational research, the…

Descriptors: Test Bias, Test Items, Difficulty Level, Test Theory

Measuring Ability, Speed, or Both? Challenges, Psychometric Solutions, and What Can Be Gained from Experimental Control

Peer reviewed

Direct link

Goldhammer, Frank – Measurement: Interdisciplinary Research and Perspectives, 2015

The main challenge of ability tests relates to the difficulty of items, whereas speed tests demand that test takers complete very easy items quickly. This article proposes a conceptual framework to represent how performance depends on both between-person differences in speed and ability and the speed-ability compromise within persons. Related…

Descriptors: Ability, Aptitude Tests, Reaction Time, Test Items

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

Using Student Ability and Item Difficulty for Making Defensible Pass/Fail Decisions for Borderline Grades

Peer reviewed
PDF on ERIC

Download full text

Shulruf, Boaz; Jones, Phil; Turner, Rolf – Higher Education Studies, 2015

The determination of Pass/Fail decisions over Borderline grades, (i.e., grades which do not clearly distinguish between the competent and incompetent examinees) has been an ongoing challenge for academic institutions. This study utilises the Objective Borderline Method (OBM) to determine examinee ability and item difficulty, and from that…

Descriptors: Undergraduate Students, Pass Fail Grading, Decision Making, Probability

The Confounding Effects of Ability, Item Difficulty, and Content Balance within Multiple Dimensions on the Estimation of Unidimensional Thetas

Direct link

Matlock, Ki Lynn – ProQuest LLC, 2013

When test forms that have equal total test difficulty and number of items vary in difficulty and length within sub-content areas, an examinee's estimated score may vary across equivalent forms, depending on how well his or her true ability in each sub-content area aligns with the difficulty of items and number of items within these areas.…

Descriptors: Test Items, Difficulty Level, Ability, Test Content

Item Parameter Changes and Equating: An Examination of the Effects of Lack of Item Parameter Invariance on Equating and Score Accuracy for Different Proficiency Levels

Direct link

Store, Davie – ProQuest LLC, 2013

The impact of particular types of context effects on actual scores is less understood although there has been some research carried out regarding certain types of context effects under the nonequivalent anchor test (NEAT) design. In addition, the issue of the impact of item context effects on scores has not been investigated extensively when item…

Descriptors: Test Items, Equated Scores, Accuracy, Item Response Theory

Optimal Designs for the Rasch Model

Peer reviewed

Direct link

Grasshoff, Ulrike; Holling, Heinz; Schwabe, Rainer – Psychometrika, 2012

In this paper, optimal designs will be derived for estimating the ability parameters of the Rasch model when difficulty parameters are known. It is well established that a design is locally D-optimal if the ability and difficulty coincide. But locally optimal designs require that the ability parameters to be estimated are known. To attenuate this…

Descriptors: Item Response Theory, Test Items, Psychometrics, Statistical Analysis

The Performance of the Linear Logistic Test Model When the Q-Matrix Is Misspecified: A Simulation Study

Direct link

MacDonald, George T. – ProQuest LLC, 2014

A simulation study was conducted to explore the performance of the linear logistic test model (LLTM) when the relationships between items and cognitive components were misspecified. Factors manipulated included percent of misspecification (0%, 1%, 5%, 10%, and 15%), form of misspecification (under-specification, balanced misspecification, and…

Descriptors: Simulation, Item Response Theory, Models, Test Items

Item Pool Design for an Operational Variable-Length Computerized Adaptive Test

Peer reviewed

Direct link

He, Wei; Reckase, Mark D. – Educational and Psychological Measurement, 2014

For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…

Descriptors: Item Banks, Test Length, Computer Assisted Testing, Adaptive Testing

Impact of Psychometric Decisions on Assessment Outcomes in an Alternate Assessment

Direct link

Rao, Vasanthi – ProQuest LLC, 2012

In 1997, based on the amendments to Individuals with Disabilities Education Act (IDEA), all states were faced with a statutory requirement to develop and implement alternate assessments for students with disabilities unable to participate in the statewide large-scale assessment. States were given the challenge of creating, implementing, and…

Descriptors: Alternative Assessment, Psychometrics, Item Response Theory, Models

Constructing Benchmarks for Monitoring Purposes: Evidence from South Africa

Peer reviewed

Direct link

Scherman, Vanessa; Howie, Sarah J.; Bosker, Roel J. – Educational Research and Evaluation, 2011

In information-rich environments, schools are often presented with a myriad of data from which decisions need to be made. The use of the information on a classroom level may be facilitated if performance could be described in terms of levels of proficiency or benchmarks. The aim of this article is to explore benchmarks using data from a monitoring…

Descriptors: Standard Setting, Foreign Countries, Grade 8, Ability

Previous Page | Next Page »

Pages: 1 | 2 | 3

Reckase, Mark D.	2
Akin-Arikan, Çigdem	1
Barnes, Laura L. B.	1
Bechger, Timo	1
Berger, Martijn P. F.	1
Bergstrom, Betty A.	1
Bickel, Peter	1
Bjermo, Jonas	1
Bosker, Roel J.	1
Buyske, Steven	1
Cantrell, Catherine E.	1
Chang, Huahua	1
DeMars, Christine E.	1
Dimitrov, Dimiter M.	1
Dogan-Gül, Çilem	1
Fischer, Gerhard H.	1
Fox, Jean-Paul	1
Gelbal, Selahattin	1
Gershon, Richard	1
Gershon, Richard C.	1
Glas, Cees A. W.	1
Goldhammer, Frank	1
Grasshoff, Ulrike	1
Griffith, Priscilla L.	1
More ▼