Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 40 |
Descriptor
Guessing (Tests) | 97 |
Test Items | 41 |
Multiple Choice Tests | 32 |
Item Response Theory | 25 |
Scores | 17 |
Difficulty Level | 16 |
Mathematical Models | 15 |
Scoring | 14 |
Test Reliability | 14 |
Test Validity | 14 |
Computer Assisted Testing | 13 |
More ▼ |
Source
Author
Wise, Steven L. | 7 |
DeMars, Christine E. | 3 |
van der Linden, Wim J. | 3 |
Albanese, Mark A. | 2 |
Angoff, William H. | 2 |
Hills, John R. | 2 |
Kikas, Eve | 2 |
Mannamaa, Mairi | 2 |
McLean, Stuart | 2 |
Schnipke, Deborah L. | 2 |
Stewart, Jeffrey | 2 |
More ▼ |
Publication Type
Reports - Evaluative | 97 |
Journal Articles | 70 |
Speeches/Meeting Papers | 16 |
Reports - Research | 4 |
Opinion Papers | 2 |
Information Analyses | 1 |
Education Level
Audience
Researchers | 2 |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 3 |
ACT Assessment | 1 |
Armed Services Vocational… | 1 |
Facts on Aging Quiz | 1 |
Graduate Management Admission… | 1 |
Preliminary Scholastic… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Baldwin, Peter – Educational Measurement: Issues and Practice, 2021
In the Bookmark standard-setting procedure, panelists are instructed to consider what examinees know rather than what they might attain by guessing; however, because examinees sometimes do guess, the procedure includes a correction for guessing. Like other corrections for guessing, the Bookmark's correction assumes that examinees either know the…
Descriptors: Guessing (Tests), Student Evaluation, Evaluation Methods, Standard Setting (Scoring)
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2020
This note raises caution that a finding of a marked pseudo-guessing parameter for an item within a three-parameter item response model could be spurious in a population with substantial unobserved heterogeneity. A numerical example is presented wherein each of two classes the two-parameter logistic model is used to generate the data on a…
Descriptors: Guessing (Tests), Item Response Theory, Test Items, Models
Gustafsson, Martin; Barakat, Bilal Fouad – Comparative Education Review, 2023
International assessments inform education policy debates, yet little is known about their floor effects: To what extent do they fail to differentiate between the lowest performers, and what are the implications of this? TIMSS, SACMEQ, and LLECE data are analyzed to answer this question. In TIMSS, floor effects have been reduced through the…
Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries
Stoeckel, Tim; McLean, Stuart; Nation, Paul – Studies in Second Language Acquisition, 2021
Two commonly used test types to assess vocabulary knowledge for the purpose of reading are size and levels tests. This article first reviews several frequently stated purposes of such tests (e.g., materials selection, tracking vocabulary growth) and provides a reasoned argument for the precision needed to serve such purposes. Then three sources of…
Descriptors: Vocabulary Development, Receptive Language, Written Language, Knowledge Level
Cipriani, Giam Pietro – Journal of Economic Education, 2018
A considerable literature in economics and psychology observes substantial gender differences in risk aversion, confidence, and responses to high pressure. In the educational measurement literature, it has been argued that these differences could disadvantage female students when taking multiple-choice tests, especially if there is a penalty for…
Descriptors: Gender Differences, Guessing (Tests), Academic Failure, Multiple Choice Tests
Drabinová, Adéla; Martinková, Patrícia – Journal of Educational Measurement, 2017
In this article we present a general approach not relying on item response theory models (non-IRT) to detect differential item functioning (DIF) in dichotomous items with presence of guessing. The proposed nonlinear regression (NLR) procedure for DIF detection is an extension of method based on logistic regression. As a non-IRT approach, NLR can…
Descriptors: Test Items, Regression (Statistics), Guessing (Tests), Identification
Stewart, Jeffrey; McLean, Stuart; Kramer, Brandon – Language Assessment Quarterly, 2017
Stewart questioned vocabulary size estimation methods proposed by Beglar and Nation for the Vocabulary Size Test, further arguing Rasch mean square (MSQ) fit statistics cannot determine the proportion of random guesses contained in the average learner's raw score, because the average value will be near 1 by design. He illustrated this by…
Descriptors: Guessing (Tests), Item Response Theory, Language Tests, Vocabulary
Smith, Ben O.; Wagner, Jamie – Journal of Economic Education, 2018
In 2016, Walstad and Wagner developed a procedure to split pre-test and post-test responses into four learning types: positive, negative, retained, and zero learning. This disaggregation is not only useful in academic studies; but also provides valuable insight to the practitioner: an instructor would take different mitigating actions in response…
Descriptors: Pretests Posttests, Value Added Models, Guessing (Tests), Monte Carlo Methods
Wise, Steven L. – Educational Measurement: Issues and Practice, 2017
The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time
Wise, Steven L.; Kingsbury, G. Gage – Journal of Educational Measurement, 2016
This study examined the utility of response time-based analyses in understanding the behavior of unmotivated test takers. For the data from an adaptive achievement test, patterns of observed rapid-guessing behavior and item response accuracy were compared to the behavior expected under several types of models that have been proposed to represent…
Descriptors: Achievement Tests, Student Motivation, Test Wiseness, Adaptive Testing
Allen, Jeff M.; Mattern, Krista – ACT, Inc., 2019
States and districts have expressed interest in administering the ACT® to 10th-grade students. Given that the ACT was designed to be administered in the spring of 11th grade or fall of 12th grade, the appropriateness of this use should be evaluated. As such, the focus of this paper is to summarize empirical evidence evaluating the use of the ACT…
Descriptors: Test Validity, College Entrance Examinations, High School Students, Grade 10
Komatsu, Kotaro; Tsujiyama, Yosuke; Sakamaki, Aruta; Koike, Norio – For the Learning of Mathematics, 2014
It has become gradually accepted that proof and proving are essential at all grades of mathematical learning. Among the various aspects of proof and proving, this study addresses proofs and refutations described by Lakatos, in particular a part of increasing content by deductive guessing, to introduce an authentic process into mathematics…
Descriptors: Mathematics Instruction, Validity, Mathematical Logic, Guessing (Tests)
Walstad, William B.; Rebeck, Ken – Journal of Economic Education, 2017
The "Test of Financial Literacy" (TFL) was created to measure the financial knowledge of high school students. Its content is based on the standards and benchmarks stated in the "National Standards for Financial Literacy" (Council for Economic Education 2013). The test development process involved extensive item writing and…
Descriptors: Tests, Money Management, Literacy, High School Students
McQuillan, Jeff; Ediger, Warren – Reading Matrix: An International Online Journal, 2018
There is considerable evidence that incidental vocabulary acquisition through reading accounts for a large portion of the growth in word knowledge for both first (L1) and second (L2) language acquirers. In this paper, we evaluate the Markov Estimate of Semantic Association (MESA) technique for detecting small, incremental gains in vocabulary…
Descriptors: Markov Processes, Vocabulary Development, Incidental Learning, Native Language
Novacek, Paul – International Association for Development of the Information Society, 2013
Traditional knowledge assessments rely on multiple-choice type questions that only report a right or wrong answer. The reliance within the education system on this technique infers that a student who provides a correct answer purely through guesswork possesses knowledge equivalent to a student who actually knows the correct answer. A more complete…
Descriptors: Adult Learning, Multiple Choice Tests, Guessing (Tests), Confidence Testing