ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	11

Descriptor

Educational Testing	24
Test Length	24
Test Construction	6
Testing Problems	6
Item Response Theory	5
Mastery Tests	5
Measurement Techniques	5
Simulation	5
Statistical Analysis	5
Test Items	5
Computer Assisted Testing	4
Criterion Referenced Tests	4
Educational Assessment	4
Models	4
Probability	4
Scores	4
Test Bias	4
Classification	3
Comparative Analysis	3
Cutting Scores	3
Decision Making	3
Educational Objectives	3
Evaluation Methods	3
Goodness of Fit	3
Guidelines	3
More ▼

Source

ProQuest LLC	4
Journal of Educational…	2
ACT, Inc.	1
ETS Research Report Series	1
Education Sciences	1
Educational and Psychological…	1
Evaluation in Education:…	1
PEPNet-Northeast	1
Pennsylvania Department of…	1
Popular Measurement	1
Rhode Island Department of…	1
More ▼

Publication Type

Reports - Research	11
Journal Articles	7
Dissertations/Theses -…	4
Reports - Evaluative	4
Guides - Non-Classroom	3
Reports - Descriptive	2
Speeches/Meeting Papers	2
Collected Works - General	1
Information Analyses	1
Opinion Papers	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Higher Education	1

Audience

Researchers	2
Practitioners	1

Location

Michigan	1
Pennsylvania	1
Rhode Island	1

Laws, Policies, & Programs

Americans with Disabilities…	1
Equal Access	1
Rehabilitation Act 1973…	1

Assessments and Surveys

Law School Admission Test	1
National Assessment of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

A Note on Using Weighted Sum Scores in the P-DIF Statistic. Research Report. ETS RR-19-32

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019

The Mantel-Haenszel delta difference (MH D-DIF) and the standardized proportion difference (STD P-DIF) are two observed-score methods that have been used to assess differential item functioning (DIF) at Educational Testing Service since the early 1990s. Latentvariable approaches to assessing measurement invariance at the item level have been…

Descriptors: Test Bias, Educational Testing, Statistical Analysis, Item Response Theory

The Effect of Person Misfit on Item Parameter Estimation and Classification Accuracy: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Mousavi, Amin; Cui, Ying – Education Sciences, 2020

Often, important decisions regarding accountability and placement of students in performance categories are made on the basis of test scores generated from tests, therefore, it is important to evaluate the validity of the inferences derived from test results. One of the threats to the validity of such inferences is aberrant responding. Several…

Descriptors: Student Evaluation, Educational Testing, Psychological Testing, Item Response Theory

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

Examination of the Parameter Estimate Bias When Violating the Orthogonality Assumption of the Bifactor Model

Direct link

Zheng, Chunmei – ProQuest LLC, 2013

Educational and psychological constructs are normally measured by multifaceted dimensions. The measured construct is defined and measured by a set of related subdomains. A bifactor model can accurately describe such data with both the measured construct and the related subdomains. However, a limitation of the bifactor model is the orthogonality…

Descriptors: Educational Testing, Measurement Techniques, Test Items, Models

Controlling Type I Error Rate in Evaluating Differential Item Functioning for Four DIF Methods: Use of Three Procedures for Adjustment of Multiple Item Testing

Direct link

Kim, Jihye – ProQuest LLC, 2010

In DIF studies, a Type I error refers to the mistake of identifying non-DIF items as DIF items, and a Type I error rate refers to the proportion of Type I errors in a simulation study. The possibility of making a Type I error in DIF studies is always present and high possibility of making such an error can weaken the validity of the assessment.…

Descriptors: Test Bias, Test Length, Simulation, Testing

Comparability of Examinee Proficiency Scores on Computer Adaptive Tests Using Real and Simulated Data

Direct link

Evans, Josiah Jeremiah – ProQuest LLC, 2010

In measurement research, data simulations are a commonly used analytical technique. While simulation designs have many benefits, it is unclear if these artificially generated datasets are able to accurately capture real examinee item response behaviors. This potential lack of comparability may have important implications for administration of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Educational Testing, Admission (School)

A Comparison of Computer-Based Classification Testing Approaches Using Mixed-Format Tests with the Generalized Partial Credit Model

Direct link

Kim, Jiseon – ProQuest LLC, 2010

Classification testing has been widely used to make categorical decisions by determining whether an examinee has a certain degree of ability required by established standards. As computer technologies have developed, classification testing has become more computerized. Several approaches have been proposed and investigated in the context of…

Descriptors: Test Length, Computer Assisted Testing, Classification, Probability

The 2008-2009 Pennsylvania System of School Assessment Handbook for Assessment Coordinators: Writing, Reading and Mathematics, Science

Download full text

Pennsylvania Department of Education, 2010

This handbook describes the responsibilities of district and school assessment coordinators in the administration of the Pennsylvania System of School Assessment (PSSA). This updated guidebook contains the following sections: (1) General Assessment Guidelines for All Assessments; (2) Writing Specific Guidelines; (3) Reading and Mathematics…

Descriptors: Guidelines, Guides, Educational Assessment, Writing Tests

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

An Investigation of the Performance of the Generalized S-X[superscript 2] Item-Fit Index for Polytomous IRT Models. ACT Research Report Series, 2007-1

Download full text

Kang, Taehoon; Chen, Troy T. – ACT, Inc., 2007

Orlando and Thissen (2000, 2003) proposed an item-fit index, S-X[superscript 2], for dichotomous item response theory (IRT) models, which has performed better than traditional item-fit statistics such as Yen's (1981) Q[subscript 1] and McKinley and Mill's (1985) G[superscript 2]. This study extends the utility of S-X[superscript 2] to polytomous…

Descriptors: Item Response Theory, Models, Computer Software, Statistical Analysis

Rhode Island State Assessment Program District and School Testing Coordinators Handbook: K-1 Assessment Program

Download full text

Rhode Island Department of Elementary and Secondary Education, 2007

This handbook will assist principals and school testing coordinators in implementing the spring 2007 administration of the Developmental Reading Assessment (DRA). Information regarding administration timeline, reporting, process, online tools and contact personnel is discussed. Contents include: (1) Scheduling; (2) Identify Primary Test…

Descriptors: Testing Accommodations, Alternative Assessment, Educational Testing, Guidance Programs

The Use of Moment Estimators for Mixtures of Two Binomials with One Known Success Parameter.

Peer reviewed

Van Der Linden, Wim J. – Educational and Psychological Measurement, 1983

This paper focuses on mixtures of two binomials with one known success parameter. It is shown how moment estimators can be obtained for the remaining unknown parameters of such mixtures, and results are presented from a Monte Carlo study carried out to explore the statistical properties of these estimators. (PN)

Descriptors: Educational Testing, Error of Measurement, Estimation (Mathematics), Guessing (Tests)

The Standard Errors of the Feldt-Gilmer Congeneric Reliability Coefficients: Iowa Testing Programs Occasional Papers. Number 31.

PDF pending restoration

Gilmer, Jerry S.; Feldt, Leonard S. – 1982

The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…

Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models

Using Empirical Data to Set Cutoff Scores.

Hills, John R. – 1979

Six experimental approaches to the problems of setting cutoff scores and choosing proper test length are briefly mentioned. Most of these methods share the premise that a test is a random sample of items, from a domain associated with a carefully specified objective. Each item is independent and is scored zero or one, with no provision for…

Descriptors: Academic Standards, Aptitude Treatment Interaction, Criterion Referenced Tests, Cutting Scores

Passing Score and Length of a Mastery Test.

van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)

Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2

Cui, Ying	2
Bauer, Ernest A.	1
Bergstrom, Betty	1
Berk, Ronald A.	1
Buchkoski, David, Comp.	1
Carlson, Ken	1
Chen, Troy T.	1
Deville, Craig	1
Dorans, Neil J.	1
Evans, Josiah Jeremiah	1
Feldt, Leonard S.	1
Gershon, Richard C.	1
Gilmer, Jerry S.	1
Guo, Hongwen	1
Hills, John R.	1
Kang, Taehoon	1
Kim, Jihye	1
Kim, Jiseon	1
Leighton, Jacqueline P.	1
Mousavi, Amin	1
Munoz-Sandoval, Ana	1
Nelson, Catherine	1
Novick, Melvin R.	1
O'Neill, Thomas	1
Ragosta, Marjorie	1
More ▼