ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Test Length	21
Testing Problems	21
Test Construction	6
Test Format	5
Test Items	5
Mathematical Models	4
Test Reliability	4
Comparative Analysis	3
Computer Assisted Testing	3
Educational Testing	3
Equated Scores	3
Evaluation Problems	3
Item Analysis	3
Item Response Theory	3
Measurement Techniques	3
Prediction	3
Test Validity	3
Adaptive Testing	2
Correlation	2
Equations (Mathematics)	2
Evaluation Methods	2
Evaluation Research	2
Goodness of Fit	2
Guidelines	2
Higher Education	2
More ▼

Source

Journal of Educational…	5
Educational and Psychological…	4
Applied Measurement in…	2
Applied Psychological…	2
Language Testing	2
Educational Research and…	1
Evaluation in Education:…	1
Participatory Educational…	1
Popular Measurement	1
Psychological Assessment	1
Science Education…	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	11
Reports - Evaluative	9
Collected Works - General	1
Opinion Papers	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Japan

Laws, Policies, & Programs

Assessments and Surveys

ACTFL Oral Proficiency…	1
Minnesota Multiphasic…	1
Personal Orientation Inventory	1
Wechsler Memory Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022

The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…

Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

The National Center Test for University Admissions

Peer reviewed

Direct link

Watanabe, Yoshinori – Language Testing, 2013

This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…

Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Simultaneous Use of Multiple Answer Copying Indexes to Improve Detection Rates

Peer reviewed

Direct link

Wollack, James A. – Applied Measurement in Education, 2006

Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…

Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size

One Iota Fills the Quota: A Paradox in Multifacet Reliability Coefficients.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1983

A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)

Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length

General and Specific Objections to the MMPI.

Peer reviewed

Gallucci, Nicholas T. – Educational and Psychological Measurement, 1986

This study evaluated the degree to which 102 undergraduate participants objected to questions on the Minnesota Multiphasic Personality Inventory (MMPI) which referred to sex, religion, bladder and bowel functions, family relationships, and unusual thinking in comparision to degree of objection to length of the MMPI and repetition of questions.…

Descriptors: College Students, Higher Education, Personality Measures, Psychological Evaluation

A New Approach to Test the Useability of a Science Question Paper in Terms of Time Allotment.

Peer reviewed

Sindhu, R. S.; Sharma, Reeta – Science Education International, 1999

Finds that the time required to attempt all the test items of each question paper in a four-paper sample was inversely proportional to the percentage of students who attempted all the test items of that paper. Extrapolates results to give guidelines for determining the feasibility of newly-developed exam papers. (WRM)

Descriptors: Science Tests, Secondary Education, Test Construction, Test Length

Parsimonious Prediction of Wechsler Memory Scale--Revised Memory Indices.

Peer reviewed

Woodard, John L.; Axelrod, Bradley N. – Psychological Assessment, 1995

Using 308 patients referred for neuropsychological evaluation, 2 regression equations were developed to predict weighted raw score sums for General Memory and Delayed Recall using the Wechsler Memory Scale-Revised (WMS-R) analogs of 5 subtests from the original WMS. The equations may help reduce WMS-R administration time. (SLD)

Descriptors: Equations (Mathematics), Memory, Neuropsychology, Patients

Efficiency of Linear Equating as a Function of the Length of the Anchor Test.

Peer reviewed

Budescu, David – Journal of Educational Measurement, 1985

An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)

Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Item Clusters and Computerized Adaptive Testing: A Case for Testlets.

Peer reviewed

Wainer, Howard; Kiely, Gerard L. – Journal of Educational Measurement, 1987

The testlet, a bundle of test items, alleviates some problems associated with computerized adaptive testing: context effects, lack of robustness, and item difficulty ordering. While testlets may be linear or hierarchical, the most useful ones are four-level hierarchical units, containing 15 items and partitioning examinees into 16 classes. (GDC)

Descriptors: Adaptive Testing, Computer Assisted Testing, Context Effect, Item Banks

Monte Carlo Evaluation of Implied Orders as a Basis for Tailored Testing.

Peer reviewed

Cudeck, Robert; And Others – Applied Psychological Measurement, 1979

TAILOR, a computer program which implements an approach to tailored testing, was examined by Monte Carlo methods. The evaluation showed the procedure to be highly reliable and capable of reducing the required number of tests items by about one half. (Author/JKS)

Descriptors: Adaptive Testing, Computer Programs, Feasibility Studies, Item Analysis

Applying Ranking and Selection Techniques to Determine the Length of a Mastery Test.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

A problem of considerable importance in certain educational settings is determining how many items to include on a mastery test. Applying ranking and selection procedures, a solution is given which includes as a special case all existing single-stage, non-Bayesian solutions based on a strong true-score model. (Author/JKS)

Descriptors: Criterion Referenced Tests, Mastery Tests, Nonparametric Statistics, Probability

Previous Page | Next Page »

Pages: 1 | 2

Axelrod, Bradley N.	1
Bergstrom, Betty	1
Budescu, David	1
Camilli, Gregory	1
Conger, Anthony J.	1
Cudeck, Robert	1
Cui, Ying	1
Deville, Craig	1
Gallucci, Nicholas T.	1
Gershon, Richard C.	1
Hattie, John	1
Isbell, Dan	1
Kafry, Ditsa	1
Kiely, Gerard L.	1
Kiliç, Abdullah Faruk	1
Leighton, Jacqueline P.	1
Munoz-Sandoval, Ana	1
O'Neill, Thomas	1
Roberts, Dennis M.	1
Sahin-Kürsad, Merve	1
Sharma, Reeta	1
Sindhu, R. S.	1
Sinharay, Sandip	1
Uysal, Ibrahim	1
More ▼