ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Difficulty Level	12
Item Sampling	12
Test Items	12
Statistical Analysis	6
Mathematical Models	4
Test Construction	4
Achievement Tests	3
Item Analysis	3
Latent Trait Theory	3
Test Validity	3
Foreign Countries	2
Item Banks	2
Item Response Theory	2
Language Tests	2
Mathematical Concepts	2
Matrices	2
Models	2
Probability	2
Sampling	2
Second Languages	2
Test Length	2
Test Reliability	2
Test Theory	2
Testing Problems	2
Ability Grouping	1
More ▼

Source

Applied Psychological…	1
Assessment & Evaluation in…	1
Eurasian Journal of…	1
Journal of Educational…	1
Online Submission	1
Psychometrika	1

Author

Berger, Martijn P. F.	1
Berk, Ronald A.	1
Burton, Richard F.	1
Forster, Fred	1
Forsyth, Robert A.	1
Klein-Braley, Christine	1
Lorié, William A.	1
Revuelta, Javier	1
Scheetz, James P.	1
Shoemaker, David M.	1
Theunissen, Phiel J. J. M.	1
van der Linden, Wim J.	1
Özyurt, Hacer	1
Özyurt, Özcan	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	4
Speeches/Meeting Papers	3
Reports - Evaluative	2
Reports - Descriptive	1
Reports - General	1

Education Level

Grade 11	1
Grade 12	1
High Schools	1
Secondary Education	1

Audience

Researchers

Location

Turkey	1
West Germany	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Ability Level Estimation of Students on Probability Unit via Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Özyurt, Hacer; Özyurt, Özcan – Eurasian Journal of Educational Research, 2015

Problem Statement: Learning-teaching activities bring along the need to determine whether they achieve their goals. Thus, multiple choice tests addressing the same set of questions to all are frequently used. However, this traditional assessment and evaluation form contrasts with modern education, where individual learning characteristics are…

Descriptors: Probability, Adaptive Testing, Computer Assisted Testing, Item Response Theory

An Application of Reverse Engineering to Automatic Item Generation: A Proof of Concept Using Automatically Generated Figures

Download full text

Lorié, William A. – Online Submission, 2013

A reverse engineering approach to automatic item generation (AIG) was applied to a figure-based publicly released test item from the Organisation for Economic Cooperation and Development (OECD) Programme for International Student Assessment (PISA) mathematical literacy cognitive instrument as part of a proof of concept. The author created an item…

Descriptors: Numeracy, Mathematical Concepts, Mathematical Logic, Difficulty Level

Further Results on the Standard Errors of Estimate Associated with Item-Examinee Sampling Procedures

Peer reviewed

Shoemaker, David M. – Journal of Educational Measurement, 1971

Descriptors: Difficulty Level, Item Sampling, Statistical Analysis, Test Construction

Sampling Knowledge and Understanding: How Long Should a Test Be?

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006

Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…

Descriptors: Item Sampling, Tests, Test Length, Test Reliability

On the Efficiency of IRT Models When Applied to Different Sampling Designs. Project Psychometric Aspects of Item Banking No. 45.

Berger, Martijn P. F. – 1989

The problem of obtaining designs that result in the most precise parameter estimates is encountered in at least two situations where item response theory (IRT) models are used. In so-called two-stage testing procedures, certain designs that match difficulty levels of the test items with the ability of the examinees may be located. Such designs…

Descriptors: Difficulty Level, Efficiency, Equations (Mathematics), Heuristics

Binomial Test Models and Item Difficulty.

Peer reviewed

van der Linden, Wim J. – Applied Psychological Measurement, 1979

The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)

Descriptors: Difficulty Level, Item Sampling, Latent Trait Theory, Mathematical Models

Analysis of Distractor Difficulty in Multiple-Choice Items

Peer reviewed

Direct link

Revuelta, Javier – Psychometrika, 2004

Two psychometric models are presented for evaluating the difficulty of the distractors in multiple-choice items. They are based on the criterion of rising distractor selection ratios, which facilitates interpretation of the subject and item parameters. Statistical inferential tools are developed in a Bayesian framework: modal a posteriori…

Descriptors: Multiple Choice Tests, Psychometrics, Models, Difficulty Level

A Consumers' Guide to Criterion-Referenced Test Item Statistics.

Berk, Ronald A. – 1978

Sixteen item statistics recommended for use in the development of criterion-referenced tests were evaluated. There were two major criteria: (1) practicability in terms of ease of computation and interpretation and (2) meaningfulness in the context of the development process. Most of the statistics were based on a comparison of performance changes…

Descriptors: Achievement Tests, Criterion Referenced Tests, Difficulty Level, Guides

Advance Prediction of Difficulty with C-Tests.

Klein-Braley, Christine – 1984

This report investigates the selection of appropriate texts for C-Tests, a modified form of the cloze test, for assessing second language learning. The procedure for textbook readability first involved the administration of different texts to sample groups to determine the C-test difficulty of individual texts. At the same time, a variety of…

Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries

A Comparison of Simple Random Sampling Versus Stratification for Allocating Items to Subtests in Multiple Matrix Sampling.

Download full text

Scheetz, James P.; Forsyth, Robert A. – 1977

Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…

Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling

Introduction to Rasch Measurement: Some Implications for Languages.

Theunissen, Phiel J. J. M. – 1983

Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…

Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling

Riding the Rasch Tiger. Part 1: Laying the Item Bank Foundation (Paul Volker Would Approve).

Forster, Fred – 1987

Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…

Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis