ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Item Sampling	12
Statistical Analysis	12
Test Items	12
Difficulty Level	6
Test Construction	6
Item Analysis	4
Achievement Tests	3
Mathematical Models	3
Reliability	3
Criterion Referenced Tests	2
Equated Scores	2
Evaluation Criteria	2
Matrices	2
Norms	2
Sampling	2
Task Analysis	2
Test Interpretation	2
Test Validity	2
Testing Problems	2
Validity	2
Ability Grouping	1
Accuracy	1
Adaptive Testing	1
Bias	1
Certification	1
More ▼

Source

Applied Psychological…	2
Cognitive Research:…	1
Journal of Educational…	1
Journal of Studies in…	1
Online Submission	1
Practical Assessment,…	1

Author

Bashkov, Bozhidar M.	1
Berk, Ronald A.	1
Clauser, Jerome C.	1
Doron, Rina	1
Douglass, James B.	1
Forsyth, Robert A.	1
Lewy, Arieh	1
Lorié, William A.	1
Marc Brysbaert	1
Passmore, David Lynn	1
Scheetz, James P.	1
Shoemaker, David M.	1
Theunissen, Phiel J. J. M.	1
Waller, Niels G.	1
van der Linden, Wim J.	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	5
Reports - Evaluative	2
Reports - General	2
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

An Application of Reverse Engineering to Automatic Item Generation: A Proof of Concept Using Automatically Generated Figures

Download full text

Lorié, William A. – Online Submission, 2013

A reverse engineering approach to automatic item generation (AIG) was applied to a figure-based publicly released test item from the Organisation for Economic Cooperation and Development (OECD) Programme for International Student Assessment (PISA) mathematical literacy cognitive instrument as part of a proof of concept. The author created an item…

Descriptors: Numeracy, Mathematical Concepts, Mathematical Logic, Difficulty Level

Commingled Samples: A Neglected Source of Bias in Reliability Analysis

Peer reviewed

Direct link

Waller, Niels G. – Applied Psychological Measurement, 2008

Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…

Descriptors: Test Items, Reliability, Scores, Psychometrics

Further Results on the Standard Errors of Estimate Associated with Item-Examinee Sampling Procedures

Peer reviewed

Shoemaker, David M. – Journal of Educational Measurement, 1971

Descriptors: Difficulty Level, Item Sampling, Statistical Analysis, Test Construction

Binomial Test Models and Item Difficulty.

Peer reviewed

van der Linden, Wim J. – Applied Psychological Measurement, 1979

The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)

Descriptors: Difficulty Level, Item Sampling, Latent Trait Theory, Mathematical Models

A Probabilistic Model of Student Nurses' Knowledge of Normal Nutrition.

Peer reviewed

Passmore, David Lynn – Journal of Studies in Technical Careers, 1983

Vocational and technical education researchers need to be aware of the uses and limits of various statistical models. The author reviews the Rasch Model and applies it to results from a nutrition test given to student nurses. (Author)

Descriptors: Educational Research, Item Sampling, Nursing Education, Nutrition

A Consumers' Guide to Criterion-Referenced Test Item Statistics.

Berk, Ronald A. – 1978

Sixteen item statistics recommended for use in the development of criterion-referenced tests were evaluated. There were two major criteria: (1) practicability in terms of ease of computation and interpretation and (2) meaningfulness in the context of the development process. Most of the statistics were based on a comparison of performance changes…

Descriptors: Achievement Tests, Criterion Referenced Tests, Difficulty Level, Guides

A Comparison of Simple Random Sampling Versus Stratification for Allocating Items to Subtests in Multiple Matrix Sampling.

Download full text

Scheetz, James P.; Forsyth, Robert A. – 1977

Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…

Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling

A Process for Testing a Methematical Model for the Solution of a Practical Problem: Applications to Test Equating.

Douglass, James B. – 1979

A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to compare five models for test equating: (1) anchor test equating using classical test theory; (2) anchor test equating using the one-parameter logistic…

Descriptors: Comparative Analysis, Equated Scores, Flow Charts, Goodness of Fit

Introduction to Rasch Measurement: Some Implications for Languages.

Theunissen, Phiel J. J. M. – 1983

Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…

Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling

Group Tailored Tests and Some Problems of their Utilization.

Lewy, Arieh; Doron, Rina – 1977

The concept of tailored testing for individuals is applied to the construction of tests for special groups and extended to apply to item content as well as item difficulty. It is suggested that evaluators may decide to construct tests on the basis of a unique combination of items drawn from an item bank to fit the need of a particular group. At…

Descriptors: Achievement Tests, Adaptive Testing, Criterion Referenced Tests, Group Norms