ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	10

Descriptor

Guidelines	14
Test Length	14
Test Items	9
Simulation	6
Comparative Analysis	5
Sample Size	5
Item Response Theory	4
Student Evaluation	4
Test Format	4
Educational Testing	3
Error of Measurement	3
Item Analysis	3
Behavioral Objectives	2
Computer Assisted Testing	2
Correlation	2
Criterion Referenced Tests	2
Difficulty Level	2
Effect Size	2
Equated Scores	2
Evaluation Methods	2
Language Tests	2
Mastery Tests	2
Models	2
Probability	2
Sampling	2
More ▼

Source

Educational and Psychological…	3
Journal of Educational…	2
Language Testing	2
PEPNet-Northeast	1
Pennsylvania Department of…	1
ProQuest LLC	1
Quality Assurance in…	1

Publication Type

Journal Articles	8
Reports - Research	7
Guides - Non-Classroom	2
Reports - Descriptive	2
Reports - Evaluative	2
Dissertations/Theses -…	1
Reference Materials -…	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Community	1
Practitioners	1
Researchers	1

Location

Japan	1
Pennsylvania	1

Laws, Policies, & Programs

Americans with Disabilities…	1
Equal Access	1
Job Training Partnership Act…	1
Rehabilitation Act 1973…	1

Assessments and Surveys

ACTFL Oral Proficiency…

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

Two IRT Characteristic Curve Linking Methods Weighted by Information

Peer reviewed

Direct link

Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022

Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning

Peer reviewed

Direct link

Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022

Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…

Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Asymptotic Standard Errors of Observed-Score Equating with Polytomous IRT Models

Peer reviewed

Direct link

Andersson, Björn – Journal of Educational Measurement, 2016

In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…

Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests

Establishing Effect Size Guidelines for Interpreting the Results of Differential Bundle Functioning Analyses Using SIBTEST

Peer reviewed

Direct link

Walker, Cindy M.; Zhang, Bo; Banks, Kathleen; Cappaert, Kevin – Educational and Psychological Measurement, 2012

The purpose of this simulation study was to establish general effect size guidelines for interpreting the results of differential bundle functioning (DBF) analyses using simultaneous item bias test (SIBTEST). Three factors were manipulated: number of items in a bundle, test length, and magnitude of uniform differential item functioning (DIF)…

Descriptors: Test Bias, Test Length, Simulation, Guidelines

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

The National Center Test for University Admissions

Peer reviewed

Direct link

Watanabe, Yoshinori – Language Testing, 2013

This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…

Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)

The 2008-2009 Pennsylvania System of School Assessment Handbook for Assessment Coordinators: Writing, Reading and Mathematics, Science

Download full text

Pennsylvania Department of Education, 2010

This handbook describes the responsibilities of district and school assessment coordinators in the administration of the Pennsylvania System of School Assessment (PSSA). This updated guidebook contains the following sections: (1) General Assessment Guidelines for All Assessments; (2) Writing Specific Guidelines; (3) Reading and Mathematics…

Descriptors: Guidelines, Guides, Educational Assessment, Writing Tests

Some Guidelines for Determining the Length of Objectives-Based Criterion-Referenced Tests.

Berk, Ronald A. – 1979

Four factors essential to determining how many items should be constructed or sampled for a set of objectives are examined: (1) importance and type of decisions to be made with the results; (2) importance and emphases assigned to the instructional and behavioral objectives; (3) number of objectives; (4) practical constraints, such as item writing…

Descriptors: Behavioral Objectives, Course Objectives, Criterion Referenced Tests, Decision Making

Providing Testing Accommodations for Deaf and Hard of Hearing Students. PEPNet Tipsheet

Direct link

Buchkoski, David, Comp. – PEPNet-Northeast, 1999

As the number of deaf and hard of hearing (d/hh) students seeking enrollment in postsecondary education programs increases, the accommodations they request to ensure equal access also increases. Most accommodations, such as interpreters, note-takers, and assistive listening devices are obvious and seldom questioned. D/HH students requesting…

Descriptors: Partial Hearing, Deafness, Educational Testing, Special Education

Prescribing Test Length for Criterion-Referenced Measurement. I. Posttests. ACT Technical Bulletin No. 18.

Download full text

Novick, Melvin R.; Lewis, Charles – 1974

In a program of Individually Prescribed Instruction (IPI), where a student's progress through each level of a program of study is governed by his performance on a test dealing with individual behavioral objectives, there is considerable value in keeping the number of items on each test at a minimum. The specified test length for each objective…

Descriptors: Behavioral Objectives, Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education

Vocational Assessment Instruments Reference Guide. A Review of Interest, Aptitude & Pre-Employment/Job Readiness Tests.

Download full text

New York State Div. for Youth, Albany. – 1985

This guide is designed to serve as a reference to assist providers of Job Training Partnership Act-funded programs in selecting appropriate interest, aptitude, and pre-employment and job readiness tests. Descriptions of 53 interest tests, 38 aptitude tests, and 37 pre-employment and job readiness tests are provided. Each description contains…

Descriptors: Aptitude Tests, Employment Potential, Evaluation Criteria, Guidelines

Andersson, Björn	1
Banks, Kathleen	1
Berk, Ronald A.	1
Buchkoski, David, Comp.	1
Cappaert, Kevin	1
Choi, Youn-Jeng	1
Goodrich, J. Marc	1
Guo, Wenjing	1
Huang, Feifei	1
Isbell, Dan	1
Koziol, Natalie A.	1
Kárász, Judit T.	1
Lee, Won-Chan	1
Lewis, Charles	1
Li, Yixing	1
Li, Zonglong	1
Novick, Melvin R.	1
Sunnassee, Devdass	1
Széll, Krisztián	1
Takács, Szabolcs	1
Walker, Cindy M.	1
Wang, Shaojie	1
Watanabe, Yoshinori	1
Winke, Paula	1
Yoon, HyeonJin	1
More ▼