ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	11

Descriptor

Item Sampling	45
Test Items	45
Test Construction	20
Difficulty Level	12
Statistical Analysis	12
Test Validity	12
Achievement Tests	11
Item Analysis	10
Test Reliability	10
Item Response Theory	9
Testing Problems	8
Criterion Referenced Tests	7
Foreign Countries	7
Item Banks	7
Mathematical Models	7
Language Tests	5
Matrices	5
Test Format	5
Test Interpretation	5
Test Length	5
Comparative Analysis	4
Equated Scores	4
Evaluation Criteria	4
Mastery Tests	4
Sampling	4
More ▼

Source

Journal of Educational…	4
Applied Psychological…	3
Online Submission	3
Educational and Psychological…	2
Psychometrika	2
Assessment & Evaluation in…	1
British Journal of…	1
Cognitive Research:…	1
College Student Journal	1
Eurasian Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Studies in…	1
Physical Review Physics…	1
Practical Assessment,…	1
More ▼

Publication Type

Reports - Research	21
Journal Articles	19
Speeches/Meeting Papers	11
Reports - Evaluative	10
Reports - Descriptive	6
Reports - General	3
Guides - Non-Classroom	2
Books	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

High Schools	2
Higher Education	2
Secondary Education	2
Elementary Education	1
Grade 11	1
Grade 12	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Researchers

Location

Bosnia and Herzegovina	1
China	1
Croatia	1
Kansas	1
Massachusetts	1
Netherlands	1
Nevada	1
Oklahoma	1
Oregon	1
Slovenia	1
Texas	1
Turkey	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
West Germany	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Computer Attitude Scale	1
Program for International…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

Application of the Professional Maturity Scale as a Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Süleyman Demir; Derya Çobanoglu Aktan; Nese Güler – International Journal of Assessment Tools in Education, 2023

This study has two main purposes. Firstly, to compare the different item selection methods and stopping rules used in Computerized Adaptive Testing (CAT) applications with simulative data generated based on the item parameters of the Vocational Maturity Scale. Secondly, to test the validity of CAT application scores. For the first purpose,…

Descriptors: Computer Assisted Testing, Adaptive Testing, Vocational Maturity, Measures (Individuals)

Maintaining Item Banks with the Rasch Model: An Example from Wave Optics

Peer reviewed

Direct link

Glamocic, Džana Salibašic; Mešic, Vanes; Neumann, Knut; Sušac, Ana; Boone, William J.; Aviani, Ivica; Hasovic, Elvedin; Erceg, Nataša; Repnik, Robert; Grubelnik, Vladimir – Physical Review Physics Education Research, 2021

Item banks are generally considered the basis of a new generation of educational measurement. In combination with specialized software, they can facilitate the computerized assembling of multiple pre-equated test forms. However, for advantages of item banks to become fully realized it is important that the item banks store a relatively large…

Descriptors: Item Banks, Test Items, Item Response Theory, Item Sampling

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

Toward Education Quality Improvement in China: A Brief Overview of the National Assessment of Education Quality

Peer reviewed

Direct link

Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019

This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…

Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests

Ability Level Estimation of Students on Probability Unit via Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Özyurt, Hacer; Özyurt, Özcan – Eurasian Journal of Educational Research, 2015

Problem Statement: Learning-teaching activities bring along the need to determine whether they achieve their goals. Thus, multiple choice tests addressing the same set of questions to all are frequently used. However, this traditional assessment and evaluation form contrasts with modern education, where individual learning characteristics are…

Descriptors: Probability, Adaptive Testing, Computer Assisted Testing, Item Response Theory

An Application of Reverse Engineering to Automatic Item Generation: A Proof of Concept Using Automatically Generated Figures

Download full text

Lorié, William A. – Online Submission, 2013

A reverse engineering approach to automatic item generation (AIG) was applied to a figure-based publicly released test item from the Organisation for Economic Cooperation and Development (OECD) Programme for International Student Assessment (PISA) mathematical literacy cognitive instrument as part of a proof of concept. The author created an item…

Descriptors: Numeracy, Mathematical Concepts, Mathematical Logic, Difficulty Level

Comparisons among Small Sample Equating Methods in a Common-Item Design

Peer reviewed

Direct link

Kim, Sooyeon; Livingston, Samuel A. – Journal of Educational Measurement, 2010

Score equating based on small samples of examinees is often inaccurate for the examinee populations. We conducted a series of resampling studies to investigate the accuracy of five methods of equating in a common-item design. The methods were chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating,…

Descriptors: Equated Scores, Test Items, Item Sampling, Item Response Theory

Commingled Samples: A Neglected Source of Bias in Reliability Analysis

Peer reviewed

Direct link

Waller, Niels G. – Applied Psychological Measurement, 2008

Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…

Descriptors: Test Items, Reliability, Scores, Psychometrics

Cognitive Rigor: Blending the Strengths of Bloom's Taxonomy and Webb's Depth of Knowledge to Enhance Classroom-Level Processes

Download full text

Hess, Karin K.; Jones, Ben S.; Carlock, Dennis; Walkup, John R. – Online Submission, 2009

To teach the rigorous skills and knowledge students need to succeed in future college-entry courses and workforce training programs, education stakeholders have increasingly called for more rigorous curricula, instruction, and assessments. Identifying the critical attributes of rigor and measuring its appearance in curricular materials is…

Descriptors: Educational Objectives, Classification, Matrices, Curriculum Development

Further Results on the Standard Errors of Estimate Associated with Item-Examinee Sampling Procedures

Peer reviewed

Shoemaker, David M. – Journal of Educational Measurement, 1971

Descriptors: Difficulty Level, Item Sampling, Statistical Analysis, Test Construction

Sampling Knowledge and Understanding: How Long Should a Test Be?

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006

Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…

Descriptors: Item Sampling, Tests, Test Length, Test Reliability

GENTEST: a Computep Program to Generate Individualized Objective Test Forms.

Peer reviewed

Wasik, John L. – Educational and Psychological Measurement, 1979

A computer program to generate individualized objective test forms for use in a Student Faced Statistics (SPS) course is described. The program features disproportionate sampling from different item domains and enhanced character generation facility for test printing purposes. (Author)

Descriptors: Computer Programs, Individualized Instruction, Item Sampling, Mastery Learning

The Use of Latent Partition Analysis to Identify Homogeneity of an Item Population

Peer reviewed

Hartke, Alan R. – Journal of Educational Measurement, 1978

Latent partition analysis is shown to be useful in determining the conceptual homogeneity of an item population. Such item populations are useful for mastery testing. Applications of latent partition analysis in assessing content validity are suggested. (Author/JKS)

Descriptors: Higher Education, Item Analysis, Item Sampling, Mastery Tests

Item Banking. Basic Testing Series.

Childs, Roy

This pamphlet describes the exciting potential of item banking--a new approach to testing which combines both comparability of scores with flexibility of test format. Item banks are collections of items where the characteristics of each item is known and these characteristics can be summated to described a test made from such items. The principle…

Descriptors: Achievement Tests, Foreign Countries, Item Analysis, Item Banks

Previous Page | Next Page »

Pages: 1 | 2 | 3

Askegaard, Lewis D.	1
Aviani, Ivica	1
Bashkov, Bozhidar M.	1
Bedard, Roger	1
Berger, Martijn P. F.	1
Berk, Ronald A.	1
Boone, William J.	1
Bors, Douglas A.	1
Boyd, Thomas A.	1
Burton, Richard F.	1
Carlock, Dennis	1
Childs, Roy	1
Clauser, Jerome C.	1
Cliff, Norman	1
Curtis, Deborah A.	1
Dennis, J. Richard	1
Derya Çobanoglu Aktan	1
Donoghue, John R.	1
Doron, Rina	1
Douglass, James B.	1
Erceg, Nataša	1
Forster, Fred	1
Forsyth, Robert A.	1
Gifford, Janice A.	1
More ▼