ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	9

Descriptor

Test Length	9
Testing Problems	9
Test Format	4
Comparative Analysis	3
Evaluation Problems	3
Item Response Theory	3
Test Items	3
Alternative Assessment	2
Computer Assisted Testing	2
Educational Testing	2
Evaluation Methods	2
Evaluation Research	2
Goodness of Fit	2
Guidelines	2
Item Analysis	2
Language Tests	2
Reading Tests	2
Sample Size	2
Second Language Learning	2
Simulation	2
Test Content	2
Test Reliability	2
Test Validity	2
Achievement Tests	1
Barriers	1
More ▼

Source

Applied Measurement in…	2
Language Testing	2
Behavioral Research and…	1
Educational Research and…	1
Journal of Educational…	1
Participatory Educational…	1
Rhode Island Department of…	1

Author

Alonzo, Julie	1
Camilli, Gregory	1
Cui, Ying	1
Isbell, Dan	1
Kahn, Josh	1
Kiliç, Abdullah Faruk	1
Leighton, Jacqueline P.	1
Nese, Joseph T.	1
Sahin-Kürsad, Merve	1
Sinharay, Sandip	1
Uysal, Ibrahim	1
Watanabe, Yoshinori	1
Winke, Paula	1
Wollack, James A.	1
More ▼

Publication Type

Journal Articles	7
Reports - Evaluative	5
Reports - Research	3
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Education	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 2	1
Grade 3	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Primary Education	1

Audience

Practitioners

Location

Japan	1
Rhode Island	1

Laws, Policies, & Programs

Assessments and Surveys

ACTFL Oral Proficiency…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022

The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…

Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Teacher Survey of the Accessibility and Text Features of the Computerized Oral Reading Evaluation (CORE). Technical Report #1601

Download full text

Kahn, Josh; Nese, Joseph T.; Alonzo, Julie – Behavioral Research and Teaching, 2016

There is strong theoretical support for oral reading fluency (ORF) as an essential building block of reading proficiency. The current and standard ORF assessment procedure requires that students read aloud a grade-level passage (˜ 250 words) in a one-to-one administration, with the number of words read correctly in 60 seconds constituting their…

Descriptors: Teacher Surveys, Oral Reading, Reading Tests, Computer Assisted Testing

The National Center Test for University Admissions

Peer reviewed

Direct link

Watanabe, Yoshinori – Language Testing, 2013

This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…

Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Simultaneous Use of Multiple Answer Copying Indexes to Improve Detection Rates

Peer reviewed

Direct link

Wollack, James A. – Applied Measurement in Education, 2006

Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…

Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size

Rhode Island State Assessment Program District and School Testing Coordinators Handbook: K-1 Assessment Program

Download full text

Rhode Island Department of Elementary and Secondary Education, 2007

This handbook will assist principals and school testing coordinators in implementing the spring 2007 administration of the Developmental Reading Assessment (DRA). Information regarding administration timeline, reporting, process, online tools and contact personnel is discussed. Contents include: (1) Scheduling; (2) Identify Primary Test…

Descriptors: Testing Accommodations, Alternative Assessment, Educational Testing, Guidance Programs