Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 8 |
Descriptor
Test Items | 15 |
Sampling | 10 |
Item Sampling | 6 |
Test Construction | 6 |
Item Response Theory | 4 |
Questionnaires | 4 |
Research Methodology | 4 |
Achievement Tests | 3 |
Data Analysis | 3 |
Evaluation Methods | 3 |
Foreign Countries | 3 |
More ▼ |
Source
Author
Bourda, Yolaine | 1 |
Brennan, Robert L. | 1 |
Bruillard, Éric | 1 |
Carlock, Dennis | 1 |
Cook, Linda L. | 1 |
Geisinger, Kurt F. | 1 |
Grube, Joel W. | 1 |
Hess, Karin K. | 1 |
Huitzing, Hiddo A. | 1 |
Jiang, Yu | 1 |
Jones, Ben S. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 15 |
Journal Articles | 8 |
Numerical/Quantitative Data | 2 |
Speeches/Meeting Papers | 2 |
Tests/Questionnaires | 2 |
Computer Programs | 1 |
Guides - Non-Classroom | 1 |
Education Level
Secondary Education | 3 |
Elementary Education | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 2 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Marc Brysbaert – Cognitive Research: Principles and Implications, 2024
Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…
Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis
Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019
This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…
Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests
Vie, Jill-Jênn; Popineau, Fabrice; Bruillard, Éric; Bourda, Yolaine – International Journal of Artificial Intelligence in Education, 2018
In large-scale assessments such as the ones encountered in MOOCs, a lot of usage data is available because of the number of learners involved. Newcomers, that just arrive on a MOOC, have various backgrounds in terms of knowledge, but the platform hardly knows anything about them. Therefore, it is crucial to elicit their knowledge fast, in order to…
Descriptors: Automation, Test Construction, Measurement, Online Courses
OECD Publishing, 2014
The "PISA 2012 Technical Report" describes the methodology underlying the PISA 2012 survey, which tested 15-year-olds' competencies in mathematics, reading and science and, in some countries, problem solving and financial literacy. It examines the design and implementation of the project at a level of detail that allows researchers to…
Descriptors: International Assessment, Secondary School Students, Foreign Countries, Achievement Tests
Classification Consistency and Accuracy for Complex Assessments under the Compound Multinomial Model
Lee, Won-Chan; Brennan, Robert L.; Wan, Lei – Applied Psychological Measurement, 2009
For a test that consists of dichotomously scored items, several approaches have been reported in the literature for estimating classification consistency and accuracy indices based on a single administration of a test. Classification consistency and accuracy have not been studied much, however, for "complex" assessments--for example,…
Descriptors: Classification, Reliability, Test Items, Scoring
Ratcliff, Roger; Starns, Jeffrey J. – Psychological Review, 2009
A new model for confidence judgments in recognition memory is presented. In the model, the match between a single test item and memory produces a distribution of evidence, with better matches corresponding to distributions with higher means. On this match dimension, confidence criteria are placed, and the areas between the criteria under the…
Descriptors: Recognition (Psychology), Models, Test Items, Reaction Time
Hess, Karin K.; Jones, Ben S.; Carlock, Dennis; Walkup, John R. – Online Submission, 2009
To teach the rigorous skills and knowledge students need to succeed in future college-entry courses and workforce training programs, education stakeholders have increasingly called for more rigorous curricula, instruction, and assessments. Identifying the critical attributes of rigor and measuring its appearance in curricular materials is…
Descriptors: Educational Objectives, Classification, Matrices, Curriculum Development
OECD Publishing (NJ1), 2009
The Organisation for Economic Cooperation and Development's (OECD's) Programme for International Student Assessment (PISA) surveys, which take place every three years, have been designed to collect information about 15-year-old students in participating countries. PISA examines how well students are prepared to meet the challenges of the future,…
Descriptors: Policy Formation, Scaling, Academic Achievement, Interrater Reliability

Wasik, John L. – Educational and Psychological Measurement, 1979
A computer program to generate individualized objective test forms for use in a Student Faced Statistics (SPS) course is described. The program features disproportionate sampling from different item domains and enhanced character generation facility for test printing purposes. (Author)
Descriptors: Computer Programs, Individualized Instruction, Item Sampling, Mastery Learning
Peay, Edmund R. – 1982
The method for questionnaire construction described in this paper makes it convenient to generate as many different forms for a questionnaire as there are respondents. The method is based on using the computer to produce the questionnaire forms themselves. In this way the items or subgroups of items of the questionnaire may be randomly ordered or…
Descriptors: Computer Assisted Testing, Computer Software, Questionnaires, Sampling
Huitzing, Hiddo A. – Applied Psychological Measurement, 2004
This article shows how set covering with item sampling (SCIS) methods can be used in the analysis and preanalysis of linear programming models for test assembly (LPTA). LPTA models can construct tests, fulfilling a set of constraints set by the test assembler. Sometimes, no solution to the LPTA model exists. The model is then said to be…
Descriptors: Mathematical Applications, Simulation, Item Sampling, Item Response Theory
Revuelta, Javier – Psychometrika, 2004
Two psychometric models are presented for evaluating the difficulty of the distractors in multiple-choice items. They are based on the criterion of rising distractor selection ratios, which facilitates interpretation of the subject and item parameters. Statistical inferential tools are developed in a Bayesian framework: modal a posteriori…
Descriptors: Multiple Choice Tests, Psychometrics, Models, Difficulty Level
Cook, Linda L.; Petersen, Nancy S. – 1986
This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…
Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods
Schultz, Matthew T.; Geisinger, Kurt F. – 1992
Research efforts have established that the Mantel-Haenszel procedure (MHP) is an effective method for detecting the presence of test items exhibiting differential item functioning (DIF). While the MHP has been advocated for situations where item response theory based methods may not be usable, recent findings have suggested that the performance of…
Descriptors: College Entrance Examinations, Comparative Analysis, Control Groups, Equations (Mathematics)
Grube, Joel W.; Keefe, Deborah B.; Stewart, Kathryn – Pacific Institute for Research and Evaluation, 2002
People who care about young people are aware of the serious problems caused by underage alcohol use. They should also be aware that there are many effective strategies for reducing underage drinking, and every state and community should be using these strategies. The "Guide to Conducting Youth Surveys" provides information about one important tool…
Descriptors: Substance Abuse, Drinking, Drug Use, Confidentiality