ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	13

Descriptor

Correlation	16
Probability	16
Test Items	16
Difficulty Level	5
Item Response Theory	5
Scores	5
Classification	3
Computer Assisted Testing	3
Foreign Countries	3
Mathematics Tests	3
Models	3
Multiple Choice Tests	3
Sample Size	3
Simulation	3
Ability	2
Adaptive Testing	2
Comparative Analysis	2
Computation	2
Educational Technology	2
Evaluation Methods	2
Item Analysis	2
Knowledge Level	2
Mathematical Models	2
Multivariate Analysis	2
Responses	2
More ▼

Source

Educational Sciences: Theory…	2
Education and Information…	1
Educational Technology &…	1
Educational and Psychological…	1
Hacettepe University Journal…	1
Infant and Child Development	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Physical Review Physics…	1
Practical Assessment,…	1
ProQuest LLC	1
Quality Assurance in…	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	12
Reports - Evaluative	3
Dissertations/Theses -…	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Grade 7	1
High Schools	1

Audience

Location

China	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Goodman-Kruskal Gamma and Dimension-Corrected Gamma in Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2021

Although Goodman-Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings. Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because…

Descriptors: Educational Assessment, Measurement, Item Analysis, Correlation

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Motivations for Using the Item Response Theory Nominal Response Model to Rank Responses to Multiple-Choice Items

Peer reviewed

Direct link

Smith, Trevor I.; Bendjilali, Nasrine – Physical Review Physics Education Research, 2022

Several recent studies have employed item response theory (IRT) to rank incorrect responses to commonly used research-based multiple-choice assessments. These studies use Bock's nominal response model (NRM) for applying IRT to categorical (nondichotomous) data, but the response rankings only utilize half of the parameters estimated by the model.…

Descriptors: Item Response Theory, Test Items, Multiple Choice Tests, Science Tests

Mediating Effects of Individuals' Ability Levels on the Relationship of Reflective-Impulsive Cognitive Style and Item Response Time in CAT

Peer reviewed

Direct link

Wang, Chao; Lu, Hong – Educational Technology & Society, 2018

This study focused on the effect of examinees' ability levels on the relationship between Reflective-Impulsive (RI) cognitive style and item response time in computerized adaptive testing (CAT). The total of 56 students majoring in Educational Technology from Shandong Normal University participated in this study, and their RI cognitive styles were…

Descriptors: Item Response Theory, Computer Assisted Testing, Cognitive Style, Correlation

Partially Compensatory Multidimensional Item Response Theory Models: Two Alternate Model Forms

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2016

Partially compensatory models may capture the cognitive skills needed to answer test items more realistically than compensatory models, but estimating the model parameters may be a challenge. Data were simulated to follow two different partially compensatory models, a model with an interaction term and a product model. The model parameters were…

Descriptors: Item Response Theory, Models, Thinking Skills, Test Items

Rethinking Mathematics Misconceptions: Using Knowledge Structures to Explain Systematic Errors within and across Content Domains

Peer reviewed
PDF on ERIC

Download full text

Rakes, Christopher R.; Ronau, Robert N. – International Journal of Research in Education and Science, 2019

The present study examined the ability of content domain (algebra, geometry, rational number, probability) to classify mathematics misconceptions. The study was conducted with 1,133 students in 53 algebra and geometry classes taught by 17 teachers from three high schools and one middle school across three school districts in a Midwestern state.…

Descriptors: Mathematics Instruction, Secondary School Teachers, Middle School Teachers, Misconceptions

Examining Differential Item Functions of Different Item Ordered Test Forms According to Item Difficulty Levels

Peer reviewed
PDF on ERIC

Download full text

Çokluk, Ömay; Gül, Emrah; Dogan-Gül, Çilem – Educational Sciences: Theory and Practice, 2016

The study aims to examine whether differential item function is displayed in three different test forms that have item orders of random and sequential versions (easy-to-hard and hard-to-easy), based on Classical Test Theory (CTT) and Item Response Theory (IRT) methods and bearing item difficulty levels in mind. In the correlational research, the…

Descriptors: Test Bias, Test Items, Difficulty Level, Test Theory

CAT Model with Personalized Algorithm for Evaluation of Estimated Student Knowledge

Peer reviewed

Direct link

Andjelic, Svetlana; Cekerevac, Zoran – Education and Information Technologies, 2014

This article presents the original model of the computer adaptive testing and grade formation, based on scientifically recognized theories. The base of the model is a personalized algorithm for selection of questions depending on the accuracy of the answer to the previous question. The test is divided into three basic levels of difficulty, and the…

Descriptors: Computer Assisted Testing, Educational Technology, Grades (Scholastic), Test Construction

A Comparison of Bookmark and Angoff Standard Setting Methods

Peer reviewed
PDF on ERIC

Download full text

Çetin, Sevda; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2013

In this research, the cut score of a foundation university was re-calculated with bookmark method and with Angoff method, each of which is a standard setting method; and the cut scores found were compared with the current proficiency score. Thus, the final cut score was found to be 27.87 with the cooperative work of 17 experts through the Angoff…

Descriptors: Standard Setting (Scoring), Comparative Analysis, Cutting Scores, Correlation

Correlates of Communalities as Matching Variables in Differential Item Functioning Analyses

Peer reviewed

Direct link

Yildirim, Huseyin H.; Yildirim, Selda – Hacettepe University Journal of Education, 2011

Multivariate matching in Differential Item Functioning (DIF) analyses may contribute to understand the sources of DIF. In this context, detecting appropriate additional matching variables is a crucial issue. This present article argues that the variables which are correlated with communalities in item difficulties can be used as an additional…

Descriptors: Test Bias, Multivariate Analysis, Probability, Regression (Statistics)

Learning to Teach Probability: Relationships among Preservice Teachers' Beliefs and Orientations, Content Knowledge, and Pedagogical Content Knowledge of Probability

Direct link

Ives, Sarah Elizabeth – ProQuest LLC, 2009

The purposes of this study were to investigate preservice mathematics teachers' orientations, content knowledge, and pedagogical content knowledge of probability; the relationships among these three aspects; and the usefulness of tasks with respect to examining these aspects of knowledge. The design of the study was a multi-case study of five…

Descriptors: Preservice Teachers, Test Items, Mathematics Teachers, Probability

A Sex Difference by Item Difficulty Interaction in Multiple-Choice Mathematics Items Administered to National Probability Samples.

Peer reviewed

Bielinski, John; Davison, Mark L. – Journal of Educational Measurement, 2001

Used mathematics achievement data from the 1992 National Assessment of Educational Progress, the Third International Mathematics and Science Study, and the National Education Longitudinal Study of 1988 to examine the sex difference by item difficulty interaction. The predicted negative correlation was found for all eight populations and was…

Descriptors: Correlation, Difficulty Level, Interaction, Mathematics Tests

Evidence of Rapid Correlation-Based Perceptual Category Learning by 4-Month-Olds

Peer reviewed

Direct link

Mareschal, Denis; Powell, Daisy; Westermann, Gert; Volein, Agnes – Infant and Child Development, 2005

Young infants are very sensitive to feature distribution information in the environment. However, existing work suggests that they do not make use of correlation information to form certain perceptual categories until at least 7 months of age. We suggest that the failure to use correlation information is a by-product of familiarization procedures…

Descriptors: Infants, Classification, Correlation, Familiarity

Previous Page | Next Page »

Pages: 1 | 2

Metsämuuronen, Jari	2
Andjelic, Svetlana	1
Bendjilali, Nasrine	1
Bielinski, John	1
Cekerevac, Zoran	1
Davison, Mark L.	1
DeMars, Christine E.	1
Dogan-Gül, Çilem	1
Gelbal, Selahattin	1
Gül, Emrah	1
Ives, Sarah Elizabeth	1
Johnson, Matthew S.	1
Kárász, Judit T.	1
Lu, Hong	1
Mareschal, Denis	1
Nandakumar, Ratna	1
Powell, Daisy	1
Rakes, Christopher R.	1
Ronau, Robert N.	1
Sinharay, Sandip	1
Smith, Trevor I.	1
Széll, Krisztián	1
Takács, Szabolcs	1
Volein, Agnes	1
Wang, Chao	1
More ▼