ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	9

Descriptor

Difficulty Level	17
Error of Measurement	17
Item Response Theory	11
Test Items	10
Test Construction	6
Simulation	5
Goodness of Fit	4
Multiple Choice Tests	4
Reading Comprehension	4
Reading Tests	4
Ability	3
Achievement Tests	3
Bayesian Statistics	3
Computer Assisted Testing	3
Equated Scores	3
Formative Evaluation	3
Pilot Projects	3
Statistical Analysis	3
Student Evaluation	3
Computation	2
Cutting Scores	2
Estimation (Mathematics)	2
Generalizability Theory	2
Grade 7	2
Item Analysis	2
More ▼

Source

Behavioral Research and…	4
Journal of Educational…	2
Applied Measurement in…	1
Educational and Psychological…	1
IEEE Transactions on Education	1
International Journal of…	1
Journal of Experimental…	1
Multivariate Behavioral…	1

Publication Type

Reports - Evaluative	17
Journal Articles	8
Numerical/Quantitative Data	4
Speeches/Meeting Papers	3
Information Analyses	1
Opinion Papers	1

Education Level

Grade 7	3
Elementary Education	2
Grade 5	2
Higher Education	2
Middle Schools	2
Postsecondary Education	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 2	1
Grade 3	1
Intermediate Grades	1
Junior High Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Using E-Z Reader to Examine the Consequences of Fixation-Location Measurement Error

Peer reviewed

Direct link

Reichle, Erik D.; Drieghe, Denis – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2015

There is an ongoing debate about whether fixation durations during reading are only influenced by the processing difficulty of the words being fixated (i.e., the serial-attention hypothesis) or whether they are also influenced by the processing difficulty of the previous and/or upcoming words (i.e., the attention-gradient hypothesis). This article…

Descriptors: Reading, Eye Movements, Error of Measurement, Difficulty Level

Applying Rasch Model and Generalizability Theory to Study Modified-Angoff Cut Scores

Peer reviewed

Direct link

Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012

The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…

Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

Download full text

Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

The Development and Technical Adequacy of Seventh-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report #1102

Download full text

Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011

This technical report describes the process of development and piloting of reading comprehension measures that are appropriate for seventh-grade students as part of an online progress screening and monitoring assessment system, http://easycbm.com. Each measure consists of an original fictional story of approximately 1,600 to 1,900 words with 20…

Descriptors: Reading Comprehension, Reading Tests, Grade 7, Test Construction

A Multilevel Nonlinear Profile Analysis Model for Dichotomous Data

Peer reviewed

Direct link

Culpepper, Steven Andrew – Multivariate Behavioral Research, 2009

This study linked nonlinear profile analysis (NPA) of dichotomous responses with an existing family of item response theory models and generalized latent variable models (GLVM). The NPA method offers several benefits over previous internal profile analysis methods: (a) NPA is estimated with maximum likelihood in a GLVM framework rather than…

Descriptors: Profiles, Item Response Theory, Models, Maximum Likelihood Statistics

Examining the Technical Adequacy of Second-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 08-08

Download full text

Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2008

This technical report describes the development of reading comprehension assessments designed for use as progress monitoring measures appropriate for 2nd Grade students. The creation, piloting, and technical adequacy of the measures are presented. The following are appended: (1) Item Specifications for MC [Multiple Choice] Comprehension - Passage…

Descriptors: Reading Comprehension, Reading Tests, Grade 2, Elementary School Students

Examining the Technical Adequacy of Fifth-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 08-07

Download full text

Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2008

This technical report describes the development and piloting of reading comprehension measures developed for use by fifth-grade students as part of an online progress monitoring assessment system, http://easycbm.com. Each comprehension measure is comprised of an original work of narrative fiction approximately 1500 words in length followed by 20…

Descriptors: Reading Comprehension, Reading Tests, Grade 5, Multiple Choice Tests

Experiences in the Application of Item Response Theory in Test Construction.

Peer reviewed

Green, Donald Ross; And Others – Applied Measurement in Education, 1989

Potential benefits of using item response theory in test construction are evaluated using the experience and evidence accumulated during nine years of using a three-parameter model in the development of major achievement batteries. Topics addressed include error of measurement, test equating, item bias, and item difficulty. (TJH)

Descriptors: Achievement Tests, Computer Assisted Testing, Difficulty Level, Equated Scores

A Simultaneous Approach to Multi-Factor DIF Analysis.

Download full text

Tang, Huixing – 1994

A method is presented for the simultaneous analysis of differential item functioning (DIF) in multi-factor situations. The method is unique in that it combines item response theory (IRT) and analysis of variance (ANOVA), takes a simultaneous approach to multifactor DIF analysis, and is capable of capturing interaction and controlling for possible…

Descriptors: Ability, Analysis of Variance, Difficulty Level, Error of Measurement

Multi-level IRT with Measurement Error in the Predictor Variables. Research Report 98-16.

Download full text

Fox, Jean-Paul; Glas, Cees A. W. – 1998

A two-level regression model is imposed on the ability parameters in an item response theory (IRT) model. The advantage of using latent rather than observed scores as dependent variables of a multilevel model is that this offers the possibility of separating the influence of item difficulty and ability level and modeling response variation and…

Descriptors: Ability, Bayesian Statistics, Difficulty Level, Error of Measurement

Equating Multiple Tests via an IRT Linking Design: Utilizing a Single Set of Anchor Items with Fixed Common Item Parameters during the Calibration Process.

Download full text

Li, Yuan H.; Griffith, William D.; Tam, Hak P. – 1997

This study explores the relative merits of a potentially useful item response theory (IRT) linking design: using a single set of anchor items with fixed common item parameters (FCIP) during the calibration process. An empirical study was conducted to investigate the appropriateness of this linking design using 6 groups of students taking 6 forms…

Descriptors: Ability, Difficulty Level, Equated Scores, Error of Measurement

Experiences in the Application of Item Response Theory in Test Construction.

Green, Donald Ross; And Others – 1988

Potential benefits of using item response theory in test construction are evaluated, based on the experience and evidence accumulated during 9 years of using a three-parameter model in the construction of major achievement batteries. Specific benefits covered include obtaining sample-free item calibrations and item-free person measurement,…

Descriptors: Achievement Tests, Computer Assisted Testing, Difficulty Level, Elementary Secondary Education

The Influence of Dimensionality on CAT Ability Estimation.

Peer reviewed

De Ayala, R. J. – Educational and Psychological Measurement, 1992

Effects of dimensionality on ability estimation of an adaptive test were examined using generated data in Bayesian computerized adaptive testing (CAT) simulations. Generally, increasing interdimensional difficulty association produced a slight decrease in test length and an increase in accuracy of ability estimation as assessed by root mean square…

Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Computer Simulation

Previous Page | Next Page »

Pages: 1 | 2

Alonzo, Julie	4
Tindal, Gerald	4
Green, Donald Ross	2
Li, Yuan H.	2
Park, Bitnara Jasmine	2
Arce, Alvaro J.	1
Bristow, M.	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Colton, Dean A.	1
Culpepper, Steven Andrew	1
De Ayala, R. J.	1
Drieghe, Denis	1
Erkorkmaz, K.	1
Fox, Jean-Paul	1
Glas, Cees A. W.	1
Griffith, William D.	1
Huissoon, J. P.	1
Irvin, P. Shawn	1
Jeon, Soo	1
Kane, Michael	1
Lai, Cheng-Fei	1
Lissitz, Robert W.	1
Liu, Kimy	1
More ▼