ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	28
Since 2006 (last 20 years)	52

Descriptor

Comparative Analysis	85
Guessing (Tests)	85
Multiple Choice Tests	32
Test Items	24
Foreign Countries	22
Item Response Theory	18
Difficulty Level	17
Item Analysis	15
Test Reliability	15
Correlation	13
Statistical Analysis	13
Models	12
Scores	12
Scoring Formulas	12
Achievement Tests	10
Testing	10
Computer Assisted Testing	9
Probability	9
Psychometrics	9
Response Style (Tests)	9
Scoring	9
Simulation	9
Test Validity	9
Ability	8
Evaluation Methods	8
More ▼

Publication Type

Reports - Research	64
Journal Articles	55
Reports - Evaluative	8
Speeches/Meeting Papers	8
Dissertations/Theses -…	4
Tests/Questionnaires	3
Reports - Descriptive	2

Education Level

Postsecondary Education	12
Higher Education	11
Secondary Education	9
Elementary Education	5
Grade 3	3
High Schools	3
Early Childhood Education	2
Grade 4	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Preschool Education	2
Elementary Secondary Education	1
Intermediate Grades	1
More ▼

Audience

Practitioners	1
Researchers	1

Location

Australia	4
Germany	3
Sweden	3
China	2
Denmark	2
Estonia	2
France	2
Turkey	2
United States	2
Austria	1
Belgium	1
Brazil	1
Canada	1
Cyprus	1
Czech Republic	1
Finland	1
Indonesia	1
Iran	1
Ireland	1
Italy	1
Japan	1
Massachusetts	1
Minnesota	1
Netherlands	1
Nigeria	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Management Admission…	2
National Assessment of…	2
Program for International…	2
SAT (College Admission Test)	2
Embedded Figures Test	1
Raven Progressive Matrices	1
Test of Standard Written…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 85 results Save | Export

Adjustment for Guessing in a Basic Statistics Test for Indonesian Undergraduate Psychology Students Using the Rasch Model

Peer reviewed

Direct link

Hayat, Bahrul – Cogent Education, 2022

The purpose of this study comprises (1) calibrating the Basic Statistics Test for Indonesian undergraduate psychology students using the Rasch model, (2) testing the impact of adjustment for guessing on item parameters, person parameters, test reliability, and distribution of item difficulty and person ability, and (3) comparing person scores…

Descriptors: Guessing (Tests), Statistics Education, Undergraduate Students, Psychology

Person-Fit Assessment under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V.; Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2020

This study examines and compares four person-fit statistics (PFSs) in the framework of the "D"- scoring method (DSM): (a) van der Flier's "U3" statistic; (b) "Ud" statistic, as a modification of "U3" under the DSM; (c) "Zd" statistic, as a modification of the "Z3 (l[subscript z])"…

Descriptors: Goodness of Fit, Item Analysis, Item Response Theory, Scoring

Identifying Guessing in English Language Tests via Rasch Fit Statistics: An Exploratory Study

Peer reviewed
PDF on ERIC

Download full text

Coniam, David; Lee, Tony; Lampropoulou, Leda – English Language Teaching, 2021

This article explores the issue of identifying guessers -- with a specific focus on multiple-choice tests. Guessing has long been considered a problem due to the fact that it compromises validity. A test taker scoring higher than they should through guessing does not provide a picture of their actual ability. After an initial description of issues…

Descriptors: Language Tests, Guessing (Tests), English (Second Language), Second Language Learning

An Alternative to the 3PL: Using Asymmetric Item Characteristic Curves to Address Guessing Effects

Peer reviewed

Direct link

Lee, Sora; Bolt, Daniel M. – Journal of Educational Measurement, 2018

Both the statistical and interpretational shortcomings of the three-parameter logistic (3PL) model in accommodating guessing effects on multiple-choice items are well documented. We consider the use of a residual heteroscedasticity (RH) model as an alternative, and compare its performance to the 3PL with real test data sets and through simulation…

Descriptors: Statistical Analysis, Models, Guessing (Tests), Multiple Choice Tests

Changes in the Speed-Ability Relation through Different Treatments of Rapid Guessing

Peer reviewed

Direct link

Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023

As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…

Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias

Comparative Analysis of Psychometric Properties of Mathematics Items Constructed by WAEC and NECO in Nigeria Using Item Response Theory Approach

Peer reviewed
PDF on ERIC

Download full text

Aborisade, Olatunbosun James; Fajobi, Olutoyin Olufunke – Educational Research and Reviews, 2020

West Africa Examination Council (WAEC) and National Examination Council (NECO) are the two major examination bodies saddled with the responsibility of awarding Senior Secondary School Certificate in Nigeria. This study examined the comparability of the psychometric properties of the items constructed by the two examination bodies using Item…

Descriptors: Foreign Countries, Mathematics Tests, Psychometrics, Test Items

Is It Worthy to Take Account of the "Guessing" in the Performance of the Raven Test? Calling for the Principle of Parsimony for Test Validation

Peer reviewed

Direct link

Lúcio, Patrícia Silva; Vandekerckhove, Joachim; Polanczyk, Guilherme V.; Cogo-Moreira, Hugo – Journal of Psychoeducational Assessment, 2021

The present study compares the fit of two- and three-parameter logistic (2PL and 3PL) models of item response theory in the performance of preschool children on the Raven's Colored Progressive Matrices. The test of Raven is widely used for evaluating nonverbal intelligence of factor g. Studies comparing models with real data are scarce on the…

Descriptors: Guessing (Tests), Item Response Theory, Test Validity, Preschool Children

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

The Predictiveness of PFA Is Improved by Incorporating the Learner's Correct Response Time Fluctuation

Peer reviewed
PDF on ERIC

Download full text

Chu, Wei; Pavlik, Philip I., Jr. – International Educational Data Mining Society, 2023

In adaptive learning systems, various models are employed to obtain the optimal learning schedule and review for a specific learner. Models of learning are used to estimate the learner's current recall probability by incorporating features or predictors proposed by psychological theory or empirically relevant to learners' performance. Logistic…

Descriptors: Reaction Time, Accuracy, Models, Predictor Variables

Investigating Test Equating Methods in Small Samples through Various Factors

Peer reviewed
PDF on ERIC

Download full text

Asiret, Semih; Sünbül, Seçil Ömür – Educational Sciences: Theory and Practice, 2016

In this study, equating methods for random group design using small samples through factors such as sample size, difference in difficulty between forms, and guessing parameter was aimed for comparison. Moreover, which method gives better results under which conditions was also investigated. In this study, 5,000 dichotomous simulated data…

Descriptors: Equated Scores, Sample Size, Difficulty Level, Guessing (Tests)

Enhancing the Effectiveness of Concept Inventories Using Textual Analysis: Investigations in an Electrical Engineering Subject

Peer reviewed

Direct link

Goncher, Andrea M.; Boles, Wageeh – European Journal of Engineering Education, 2019

Concept inventories (CIs) are assessment instruments designed to measure students' conceptual understanding of fundamental concepts in particular fields. CIs utilise multiple-choice questions (MCQs), and specifically designed response selections, to help identify misconceptions. One shortcoming of this assessment instrument is that it fails to…

Descriptors: Engineering Education, Misconceptions, Concept Formation, Evaluation Methods

Subjective Priors for Item Response Models: Application of Elicitation by Design

Peer reviewed

Direct link

Ames, Allison; Smith, Elizabeth – Journal of Educational Measurement, 2018

Bayesian methods incorporate model parameter information prior to data collection. Eliciting information from content experts is an option, but has seen little implementation in Bayesian item response theory (IRT) modeling. This study aims to use ethical reasoning content experts to elicit prior information and incorporate this information into…

Descriptors: Item Response Theory, Bayesian Statistics, Ethics, Specialists

Same Test, Better Scores: Boosting the Reliability of Short Online Intelligence Recruitment Tests with Nested Logit Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019

Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…

Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability

The Effects of Embedding Knowledge-Check Questions in Instructional Videos

Peer reviewed

Direct link

Marshall, Francisca B.; Marshall, Justin – Journal of Interactive Learning Research, 2021

The goal of this study was to explore how knowledge-check questions in video lectures affected learning. In a quasi-experimental study, six courses (n=84) were assigned to one of three groups: a control group and two treatment groups. The three groups saw the same video and knowledge-check questions. The three groups were evaluated with different…

Descriptors: Teaching Methods, Video Technology, Knowledge Level, Scores

Reducing the Need for Guesswork in Multiple-Choice Tests

Peer reviewed

Direct link

Bush, Martin – Assessment & Evaluation in Higher Education, 2015

The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	9
Journal of Educational…	6
ProQuest LLC	4
Applied Measurement in…	3
Applied Psychological…	3
English Language Teaching	2
Journal of Experimental…	2
Journal of Psychoeducational…	2
Language Testing	2
Assessment & Evaluation in…	1
Assessment in Education:…	1
British Journal of…	1
Cogent Education	1
College Entrance Examination…	1
Computers & Education	1
Developmental Science	1
Educational Evaluation and…	1
Educational Psychology	1
Educational Research and…	1
Educational Sciences: Theory…	1
Educational Technology &…	1
European Journal of…	1
Graduate Management Admission…	1
International Educational…	1
International Journal of…	1
More ▼

Weiss, David J.	3
Bulunuz, Mizrap	2
Bulunuz, Nermin	2
Frary, Robert B.	2
Hsu, Tse-Chi	2
Kikas, Eve	2
Koehler, Roger A.	2
Lord, Frederic M.	2
Mannamaa, Mairi	2
Stenlund, Tova	2
Aborisade, Olatunbosun James	1
Abulela, Mohammed A. A.	1
Al-shumaimeri, Yousif	1
Ames, Allison	1
Angoff, William H.	1
Asiret, Semih	1
Atanasov, Dimitar V.	1
Baghaei, Purya	1
Balota, David A.	1
Baniabdelrahman, Abdallah…	1
Barnes, Laura L. B.	1
Baron, Simon	1
Belur, Madhu N.	1
Bernard, David	1
More ▼