ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Test Length	14
Testing Problems	14
Computer Assisted Testing	4
Mathematical Models	4
Test Construction	4
Test Format	4
Test Items	4
Adaptive Testing	3
Educational Testing	3
Item Analysis	3
Measurement Techniques	3
Prediction	3
Test Validity	3
Comparative Analysis	2
Cutting Scores	2
Equated Scores	2
Equations (Mathematics)	2
Error Patterns	2
Evaluation Methods	2
Evaluation Problems	2
Evaluation Research	2
Guidelines	2
Item Banks	2
Language Tests	2
Mastery Tests	2
More ▼

Source

Journal of Educational…	2
Language Testing	2
Applied Measurement in…	1
Applied Psychological…	1
Educational Research and…	1
Evaluation in Education:…	1
Popular Measurement	1

Publication Type

Reports - Evaluative	14
Journal Articles	9
Speeches/Meeting Papers	3
Collected Works - General	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Japan

Laws, Policies, & Programs

Assessments and Surveys

ACTFL Oral Proficiency…

What Works Clearinghouse Rating

Showing all 14 results Save | Export

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

The National Center Test for University Admissions

Peer reviewed

Direct link

Watanabe, Yoshinori – Language Testing, 2013

This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…

Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Simultaneous Use of Multiple Answer Copying Indexes to Improve Detection Rates

Peer reviewed

Direct link

Wollack, James A. – Applied Measurement in Education, 2006

Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…

Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size

Congeneric Models and Levine's Linear Equating Procedures.

Download full text

Brennan, Robert L. – 1990

In 1955, R. Levine introduced two linear equating procedures for the common-item non-equivalent populations design. His procedures make the same assumptions about true scores; they differ in terms of the nature of the equating function used. In this paper, two parameterizations of a classical congeneric model are introduced to model the variables…

Descriptors: Equated Scores, Equations (Mathematics), Mathematical Models, Research Design

Monte Carlo Evaluation of Implied Orders as a Basis for Tailored Testing.

Peer reviewed

Cudeck, Robert; And Others – Applied Psychological Measurement, 1979

TAILOR, a computer program which implements an approach to tailored testing, was examined by Monte Carlo methods. The evaluation showed the procedure to be highly reliable and capable of reducing the required number of tests items by about one half. (Author/JKS)

Descriptors: Adaptive Testing, Computer Programs, Feasibility Studies, Item Analysis

Stepping Up Test Score Conditional Variances.

Peer reviewed

Woodruff, David – Journal of Educational Measurement, 1991

Improvements are made on previous estimates for the conditional standard error of measurement in prediction, the conditional standard error of estimation (CSEE), and the conditional standard error of prediction (CSEP). Better estimates of how test length affects CSEE and CSEP are derived. (SLD)

Descriptors: Equations (Mathematics), Error of Measurement, Estimation (Mathematics), Mathematical Models

Three Practical Issues for Modern Adaptive Testing Item Pools.

Download full text

Stocking, Martha L. – 1994

As adaptive testing moves toward operational implementation in large scale testing programs, where it is important that adaptive tests be as parallel as possible to existing linear tests, a number of practical issues arise. This paper concerns three such issues. First, optimum item pool size is difficult to determine in advance of pool…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Standards

Passing Score and Length of a Mastery Test: An Old Problem Appraoched Anew. Twente Educational Report Number 11.

Download full text

van der Linden, Wim J. – 1980

A classical problem in mastery testing is the choice of passing score and test length so that the mastery decisions are optimal. This problem has been addressed several times from a variety of viewpoints. In this paper, the usual indifference zone approach is adopted, with a new criterion for optimizing the passing score. Specifically,…

Descriptors: Classification, Cutting Scores, Error Patterns, Guessing (Tests)

Passing Score and Length of a Mastery Test.

van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)

Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models

Testing Testing Testing.

Peer reviewed

Deville, Craig; O'Neill, Thomas; Wright, Benjamin D.; Woodcock, Richard W.; Munoz-Sandoval, Ana; Gershon, Richard C.; Bergstrom, Betty – Popular Measurement, 1998

Articles in this special section consider (1) flow in test taking (Craig Deville); (2) testwiseness (Thomas O'Neill); (3) test length (Benjamin Wright); (4) cross-language test equating (Richard W. Woodcock and Ana Munoz-Sandoval); (5) computer-assisted testing and testwiseness (Richard Gershon and Betty Bergstrom); and (6) Web-enhanced testing…

Descriptors: Computer Assisted Testing, Educational Testing, Equated Scores, Measurement Techniques

Pretesting alongside an Operational CAT.

Download full text

Davey, Tim; Pommerich, Mary; Thompson, Tony D. – 1999

In computerized adaptive testing (CAT), new or experimental items are frequently administered alongside operational tests to gather the pretest data needed to replenish and replace item pools. The two basic strategies used to combine pretest and operational items are embedding and appending. Variable-length CATs are preferred because of the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Measurement Techniques

Parallel Short Forms of the Crandall Social Desirability Test for Children: Shortening Instruments for Research Purposes.

Download full text

Carifio, James – 1992

Researchers and program evaluators would often like to use a particular instrument, but do not because it is too long or would require too much testing time. Having a validated set of objective procedures for reducing the size of an instrument could improve many research and evaluation efforts. This paper reports the results of test reduction or…

Descriptors: Attitude Measures, Elementary School Students, Factor Analysis, Intermediate Grades

van der Linden, Wim J.	2
Bergstrom, Betty	1
Brennan, Robert L.	1
Camilli, Gregory	1
Carifio, James	1
Cudeck, Robert	1
Cui, Ying	1
Davey, Tim	1
Deville, Craig	1
Gershon, Richard C.	1
Isbell, Dan	1
Leighton, Jacqueline P.	1
Munoz-Sandoval, Ana	1
O'Neill, Thomas	1
Pommerich, Mary	1
Stocking, Martha L.	1
Thompson, Tony D.	1
Watanabe, Yoshinori	1
Winke, Paula	1
Wollack, James A.	1
Woodcock, Richard W.	1
Woodruff, David	1
Wright, Benjamin D.	1
More ▼