ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	8

Descriptor

Test Length	54
Testing Problems	54
Test Construction	20
Test Items	18
Test Validity	17
Test Format	14
Test Reliability	13
Item Analysis	10
Multiple Choice Tests	10
Computer Assisted Testing	9
Mathematical Models	9
Achievement Tests	8
Higher Education	8
Mastery Tests	8
Adaptive Testing	7
Timed Tests	7
Educational Assessment	6
Educational Testing	6
Elementary Secondary Education	6
Equated Scores	6
Item Banks	6
Scores	6
Criterion Referenced Tests	5
Comparative Analysis	4
Correlation	4
More ▼

Source

Educational and Psychological…	5
Journal of Educational…	5
Applied Measurement in…	2
Applied Psychological…	2
Language Testing	2
Behavioral Research and…	1
Educational Research and…	1
Evaluation in Education:…	1
Participatory Educational…	1
Popular Measurement	1
Psychological Assessment	1
Rhode Island Department of…	1
Science Education…	1
More ▼

Publication Type

Reports - Research	28
Journal Articles	21
Reports - Evaluative	14
Speeches/Meeting Papers	14
Information Analyses	5
Numerical/Quantitative Data	3
Opinion Papers	3
Guides - Non-Classroom	2
Reports - Descriptive	2
Collected Works - General	1
ERIC Publications	1
Guides - Classroom - Learner	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Education	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 2	1
Grade 3	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Primary Education	1

Audience

Researchers	6
Practitioners	1

Location

Japan	1
New Jersey	1
Rhode Island	1
United Kingdom	1
Vermont	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
ACTFL Oral Proficiency…	1
General Educational…	1
Minnesota Multiphasic…	1
Personal Orientation Inventory	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Test of English as a Foreign…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1
Wechsler Memory Scale	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 54 results Save | Export

Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022

The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…

Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Teacher Survey of the Accessibility and Text Features of the Computerized Oral Reading Evaluation (CORE). Technical Report #1601

Download full text

Kahn, Josh; Nese, Joseph T.; Alonzo, Julie – Behavioral Research and Teaching, 2016

There is strong theoretical support for oral reading fluency (ORF) as an essential building block of reading proficiency. The current and standard ORF assessment procedure requires that students read aloud a grade-level passage (˜ 250 words) in a one-to-one administration, with the number of words read correctly in 60 seconds constituting their…

Descriptors: Teacher Surveys, Oral Reading, Reading Tests, Computer Assisted Testing

The National Center Test for University Admissions

Peer reviewed

Direct link

Watanabe, Yoshinori – Language Testing, 2013

This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…

Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Detection of Cheating on Multiple-Choice Examinations.

Download full text

Bay, Luz – 1995

An index is proposed to detect cheating on multiple-choice examinations, and its use is evaluated through simulations. The proposed index is based on the compound binomial distribution. In total, 360 simulated data sets reflecting 12 different cheating (copying) situations were obtained and used for the study of the sensitivity of the index in…

Descriptors: Cheating, Class Size, Identification, Multiple Choice Tests

One Iota Fills the Quota: A Paradox in Multifacet Reliability Coefficients.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1983

A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)

Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length

The Relationship of the General Educational Performance Index Measure to Other Indicators of Educational Development in Each of Three Samples from a United States Army Population.

Peer reviewed

Modjeski, Richard B.; Michael, William B. – Educational and Psychological Measurement, 1978

The General Education Performance Index (GEPI) is a comparatively short test covering the same content as the General Educational Development Test (GED), which takes ten hours to administer. Correlations of the subtests of the GEPI with the GED ranged from .28 to .57. (JKS)

Descriptors: Correlation, Equivalency Tests, Military Personnel, Statistical Data

Congeneric Models and Levine's Linear Equating Procedures.

Download full text

Brennan, Robert L. – 1990

In 1955, R. Levine introduced two linear equating procedures for the common-item non-equivalent populations design. His procedures make the same assumptions about true scores; they differ in terms of the nature of the equating function used. In this paper, two parameterizations of a classical congeneric model are introduced to model the variables…

Descriptors: Equated Scores, Equations (Mathematics), Mathematical Models, Research Design

Simultaneous Use of Multiple Answer Copying Indexes to Improve Detection Rates

Peer reviewed

Direct link

Wollack, James A. – Applied Measurement in Education, 2006

Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…

Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size

General and Specific Objections to the MMPI.

Peer reviewed

Gallucci, Nicholas T. – Educational and Psychological Measurement, 1986

This study evaluated the degree to which 102 undergraduate participants objected to questions on the Minnesota Multiphasic Personality Inventory (MMPI) which referred to sex, religion, bladder and bowel functions, family relationships, and unusual thinking in comparision to degree of objection to length of the MMPI and repetition of questions.…

Descriptors: College Students, Higher Education, Personality Measures, Psychological Evaluation

A New Approach to Test the Useability of a Science Question Paper in Terms of Time Allotment.

Peer reviewed

Sindhu, R. S.; Sharma, Reeta – Science Education International, 1999

Finds that the time required to attempt all the test items of each question paper in a four-paper sample was inversely proportional to the percentage of students who attempted all the test items of that paper. Extrapolates results to give guidelines for determining the feasibility of newly-developed exam papers. (WRM)

Descriptors: Science Tests, Secondary Education, Test Construction, Test Length

Rhode Island State Assessment Program District and School Testing Coordinators Handbook: K-1 Assessment Program

Download full text

Rhode Island Department of Elementary and Secondary Education, 2007

This handbook will assist principals and school testing coordinators in implementing the spring 2007 administration of the Developmental Reading Assessment (DRA). Information regarding administration timeline, reporting, process, online tools and contact personnel is discussed. Contents include: (1) Scheduling; (2) Identify Primary Test…

Descriptors: Testing Accommodations, Alternative Assessment, Educational Testing, Guidance Programs

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Wainer, Howard	2
Wilcox, Rand R.	2
van der Linden, Wim J.	2
Alonzo, Julie	1
Axelrod, Bradley N.	1
Bay, Luz	1
Bergstrom, Betty	1
Boyd, Thomas A.	1
Brennan, Robert L.	1
Budescu, David	1
Camilli, Gregory	1
Carifio, James	1
Carlson, Ken	1
Coats, Pamela K.	1
Conger, Anthony J.	1
Cudeck, Robert	1
Cui, Ying	1
Davey, Tim	1
Deville, Craig	1
Freedman, Sarah Warshauer	1
Gallucci, Nicholas T.	1
Gershon, Richard C.	1
Gilmer, Jerry S.	1
Graham, Darol L.	1
More ▼