ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Difficulty Level	17
Item Sampling	17
Test Items	12
Statistical Analysis	7
Item Analysis	5
Test Construction	5
Test Reliability	5
Test Validity	5
Mathematical Models	4
Achievement Tests	3
Latent Trait Theory	3
Models	3
Probability	3
Test Theory	3
Adaptive Testing	2
Computer Assisted Testing	2
Criterion Referenced Tests	2
Foreign Countries	2
Item Banks	2
Item Response Theory	2
Language Tests	2
Mathematical Concepts	2
Matrices	2
Sampling	2
Second Languages	2
More ▼

Source

Applied Psychological…	2
Assessment & Evaluation in…	1
Educational Psychology Review	1
Educational and Psychological…	1
Eurasian Journal of…	1
Illinois School Research	1
Journal of Educational…	1
Online Submission	1
Psychometrika	1

Publication Type

Reports - Research	8
Journal Articles	5
Speeches/Meeting Papers	4
Reports - Evaluative	2
Reports - Descriptive	1
Reports - General	1

Education Level

Grade 11	1
Grade 12	1
High Schools	1
Secondary Education	1

Audience

Researchers

Location

Turkey	1
West Germany	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Improving the Use of Retrieval Practice for Both Easy and Difficult Materials: The Effect of an Instructional Intervention

Peer reviewed

Direct link

Tian Fan; Luotong Hui; Liang Luo; Anique B. H. de Bruin – Educational Psychology Review, 2024

Recent research has suggested that students prefer restudying over retrieval practice when learning difficult materials, despite the latter being a more effective learning strategy. The current study investigated whether an instructional intervention can improve the use of retrieval practice for both easy and difficult materials. In Experiment 1,…

Descriptors: Information Retrieval, Intervention, Difficulty Level, Learning Strategies

Ability Level Estimation of Students on Probability Unit via Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Özyurt, Hacer; Özyurt, Özcan – Eurasian Journal of Educational Research, 2015

Problem Statement: Learning-teaching activities bring along the need to determine whether they achieve their goals. Thus, multiple choice tests addressing the same set of questions to all are frequently used. However, this traditional assessment and evaluation form contrasts with modern education, where individual learning characteristics are…

Descriptors: Probability, Adaptive Testing, Computer Assisted Testing, Item Response Theory

An Application of Reverse Engineering to Automatic Item Generation: A Proof of Concept Using Automatically Generated Figures

Download full text

Lorié, William A. – Online Submission, 2013

A reverse engineering approach to automatic item generation (AIG) was applied to a figure-based publicly released test item from the Organisation for Economic Cooperation and Development (OECD) Programme for International Student Assessment (PISA) mathematical literacy cognitive instrument as part of a proof of concept. The author created an item…

Descriptors: Numeracy, Mathematical Concepts, Mathematical Logic, Difficulty Level

Further Results on the Standard Errors of Estimate Associated with Item-Examinee Sampling Procedures

Peer reviewed

Shoemaker, David M. – Journal of Educational Measurement, 1971

Descriptors: Difficulty Level, Item Sampling, Statistical Analysis, Test Construction

Standard Errors of Estimate in Item-Examinee Sampling as a Function of Test Reliability, Variation in Item Difficulty Indices and Degree of Skewness in the Normative Distribution

Peer reviewed

Shoemaker, David M. – Educational and Psychological Measurement, 1972

Descriptors: Difficulty Level, Error of Measurement, Item Sampling, Simulation

Sampling Knowledge and Understanding: How Long Should a Test Be?

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006

Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…

Descriptors: Item Sampling, Tests, Test Length, Test Reliability

On the Efficiency of IRT Models When Applied to Different Sampling Designs. Project Psychometric Aspects of Item Banking No. 45.

Berger, Martijn P. F. – 1989

The problem of obtaining designs that result in the most precise parameter estimates is encountered in at least two situations where item response theory (IRT) models are used. In so-called two-stage testing procedures, certain designs that match difficulty levels of the test items with the ability of the examinees may be located. Such designs…

Descriptors: Difficulty Level, Efficiency, Equations (Mathematics), Heuristics

Binomial Test Models and Item Difficulty.

Peer reviewed

van der Linden, Wim J. – Applied Psychological Measurement, 1979

The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)

Descriptors: Difficulty Level, Item Sampling, Latent Trait Theory, Mathematical Models

Analysis of Distractor Difficulty in Multiple-Choice Items

Peer reviewed

Direct link

Revuelta, Javier – Psychometrika, 2004

Two psychometric models are presented for evaluating the difficulty of the distractors in multiple-choice items. They are based on the criterion of rising distractor selection ratios, which facilitates interpretation of the subject and item parameters. Statistical inferential tools are developed in a Bayesian framework: modal a posteriori…

Descriptors: Multiple Choice Tests, Psychometrics, Models, Difficulty Level

A Consumers' Guide to Criterion-Referenced Test Item Statistics.

Berk, Ronald A. – 1978

Sixteen item statistics recommended for use in the development of criterion-referenced tests were evaluated. There were two major criteria: (1) practicability in terms of ease of computation and interpretation and (2) meaningfulness in the context of the development process. Most of the statistics were based on a comparison of performance changes…

Descriptors: Achievement Tests, Criterion Referenced Tests, Difficulty Level, Guides

Aspects and Applications of Criterion-Referenced Tests

Kriewall, Thomas E. – Illinois School Research, 1972

Author discusses and defines criterion tests in the context of classroom needs that have created much of the interest in the theory at this time. The primary source of interest is related to the growing implementation of individualized curricula. (Author/CB)

Descriptors: Criterion Referenced Tests, Difficulty Level, Individualized Instruction, Item Analysis

Some Item Analysis and Test Theory for a System of Computer-Assisted Test Construction for Individualized Instruction

Peer reviewed

Lord, Frederic M. – Applied Psychological Measurement, 1977

Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…

Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms

Advance Prediction of Difficulty with C-Tests.

Klein-Braley, Christine – 1984

This report investigates the selection of appropriate texts for C-Tests, a modified form of the cloze test, for assessing second language learning. The procedure for textbook readability first involved the administration of different texts to sample groups to determine the C-test difficulty of individual texts. At the same time, a variety of…

Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries

A Comparison of Simple Random Sampling Versus Stratification for Allocating Items to Subtests in Multiple Matrix Sampling.

Download full text

Scheetz, James P.; Forsyth, Robert A. – 1977

Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…

Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling

A Tailored Testing Model Employing the Beta Distribution and Conditional Difficulties.

Download full text

Kalisch, Stanley J. – 1974

A tailored testing model employing the beta distribution, whose mean equals the difficulty of an item and whose variance is approximately equal to the sampling variance of the item difficulty, and employing conditional item difficulties, is proposed. The model provides a procedure by which a minimum number of items of a test, consisting of a set…

Descriptors: Adaptive Testing, Branching, Computer Oriented Programs, Decision Making

Previous Page | Next Page »

Pages: 1 | 2

Shoemaker, David M.	2
Anique B. H. de Bruin	1
Berger, Martijn P. F.	1
Berk, Ronald A.	1
Burton, Richard F.	1
Forster, Fred	1
Forsyth, Robert A.	1
Kalisch, Stanley J.	1
Klein-Braley, Christine	1
Kriewall, Thomas E.	1
Liang Luo	1
Lord, Frederic M.	1
Lorié, William A.	1
Luotong Hui	1
Revuelta, Javier	1
Scheetz, James P.	1
Theunissen, Phiel J. J. M.	1
Tian Fan	1
van der Linden, Wim J.	1
Özyurt, Hacer	1
Özyurt, Özcan	1
More ▼