ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	22
Since 2016 (last 10 years)	88
Since 2006 (last 20 years)	158

Descriptor

Correlation	188
Test Items	188
Test Reliability	120
Test Validity	71
Foreign Countries	66
Scores	57
Factor Analysis	55
Reliability	55
Test Construction	52
Statistical Analysis	47
Psychometrics	46
Item Analysis	37
Difficulty Level	36
Item Response Theory	32
Comparative Analysis	28
Measures (Individuals)	24
College Students	23
Construct Validity	21
Undergraduate Students	21
Factor Structure	20
Likert Scales	20
Interrater Reliability	19
Validity	19
Goodness of Fit	17
Scoring	17
More ▼

Publication Type

Reports - Research	148
Journal Articles	134
Tests/Questionnaires	23
Reports - Evaluative	20
Speeches/Meeting Papers	13
Dissertations/Theses -…	9
Numerical/Quantitative Data	6
Reports - Descriptive	5
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Multilingual/Bilingual…	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	66
Postsecondary Education	49
Elementary Education	17
Secondary Education	17
Middle Schools	9
High Schools	8
Early Childhood Education	7
Junior High Schools	7
Elementary Secondary Education	6
Grade 8	5
Primary Education	5
Grade 7	4
Intermediate Grades	4
Grade 2	3
Grade 5	3
Grade 6	3
Kindergarten	3
Adult Education	2
Grade 1	2
Grade 3	2
Grade 4	2
Grade 9	2
Two Year Colleges	2
Preschool Education	1
More ▼

Audience

Researchers	3
Practitioners	1
Teachers	1

Location

Turkey	20
California	4
Canada	4
Germany	4
New York	4
China	3
Florida	3
India	3
Australia	2
Illinois	2
Japan	2
Taiwan	2
United Kingdom	2
United Kingdom (England)	2
United Kingdom (London)	2
Vietnam	2
Arizona	1
Chile	1
Colombia	1
District of Columbia	1
Egypt	1
Greece	1
Indonesia	1
Iowa	1
Iran	1
More ▼

Laws, Policies, & Programs

United Nations Convention on…

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 188 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

To What Extent Are Item Discrimination Values Realistic? A New Index for Two-Dimensional Structures

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Uysal, Ibrahim – International Journal of Assessment Tools in Education, 2022

Most researchers investigate the corrected item-total correlation of items when analyzing item discrimination in multi-dimensional structures under the Classical Test Theory, which might lead to underestimating item discrimination, thereby removing items from the test. Researchers might investigate the corrected item-total correlation with the…

Descriptors: Item Analysis, Correlation, Item Response Theory, Test Items

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

The Role of Item Distributions on Reliability Estimation: The Case of Cronbach's Coefficient Alpha

Peer reviewed

Direct link

Olvera Astivia, Oscar Lorenzo; Kroc, Edward; Zumbo, Bruno D. – Educational and Psychological Measurement, 2020

Simulations concerning the distributional assumptions of coefficient alpha are contradictory. To provide a more principled theoretical framework, this article relies on the Fréchet-Hoeffding bounds, in order to showcase that the distribution of the items play a role on the estimation of correlations and covariances. More specifically, these bounds…

Descriptors: Test Items, Test Reliability, Computation, Correlation

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Comparison of Cronbach's Alpha and McDonald's Omega for Ordinal Data: Are They Different?

Peer reviewed
PDF on ERIC

Download full text

Fatih Orcan – International Journal of Assessment Tools in Education, 2023

Among all, Cronbach's Alpha and McDonald's Omega are commonly used for reliability estimations. The alpha uses inter-item correlations while omega is based on a factor analysis result. This study uses simulated ordinal data sets to test whether the alpha and omega produce different estimates. Their performances were compared according to the…

Descriptors: Statistical Analysis, Monte Carlo Methods, Correlation, Factor Analysis

Development and Initial Validation of Digital Age Teaching Scale (DATS) to Assess Application of ISTE Standards for Educators in K-12 Education Classrooms

Peer reviewed

Direct link

Vucaj, Indrit – Journal of Research on Technology in Education, 2022

This study presents the methodological and procedural development process of the Digital Age Teaching Scale (DATS), a summative assessment tool designed to measure application of the ISTE Standards for Educators in K-12 classrooms. The theoretical framework of the ISTE Standards for Educators informed the development of DATS, and an 8-step process…

Descriptors: Elementary Secondary Education, Standards, Test Construction, Test Items

Modeling Local Item Dependence in Cloze Tests with the Rasch Model: Applying a New Strategy

Peer reviewed
PDF on ERIC

Download full text

Barno S. Abdullaeva; Diyorjon Abdullaev; Nurislom I. Khursanov; Khurshida B. Kadirova; Laylo Djuraeva – International Journal of Language Testing, 2024

Cloze tests are commonly used in language testing as a quick measure of overall language ability or reading comprehension. A problem for the analysis of cloze tests with item response theory models is that cloze test items are locally dependent. This leads to the violation of the conditional or local independence assumption of IRT models. In this…

Descriptors: Cloze Procedure, Language Tests, Test Items, Correlation

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

Estimating the Impact of Local Item Dependency in a Test of Second Language Reading Comprehension

Peer reviewed
PDF on ERIC

Download full text

Tim Stoeckel; Liang Ye Tan; Hung Tan Ha; Nam Thi Phuong Ho; Tomoko Ishii; Young Ae Kim; Chunmei Huang; Stuart McLean – Vocabulary Learning and Instruction, 2024

Local item dependency (LID) occurs when test-takers' responses to one test item are affected by their responses to another. It can be problematic if it causes inflated reliability estimates or distorted person and item measures. The cued-recall reading comprehension test in Hu and Nation's (2000) well-known and influential coverage--comprehension…

Descriptors: Reading Comprehension, English (Second Language), Second Language Instruction, Second Language Learning

Preliminary Findings to Support the Internal Consistency and Factor Structure of the Ferrari-Lynch-Vogel Listening Test (FLVLT)

Peer reviewed

Direct link

Ferrari-Bridgers, Franca – International Journal of Listening, 2023

While many tools exist to assess student content knowledge, there are few that assess whether students display the critical listening skills necessary to interpret the quality of a speaker's message at the college level. The following research provides preliminary evidence for the internal consistency and factor structure of a tool, the…

Descriptors: Factor Structure, Test Validity, Community College Students, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13

Educational and Psychological…	11
ProQuest LLC	9
Online Submission	8
ETS Research Report Series	6
Eurasian Journal of…	5
Educational Research and…	4
Grantee Submission	4
Journal of Educational and…	4
Assessment & Evaluation in…	3
CBE - Life Sciences Education	3
International Journal of…	3
Journal of Education and…	3
Applied Psychological…	2
Educational Sciences: Theory…	2
International Journal of…	2
International Journal of…	2
Journal of Education and…	2
Journal of Educational…	2
Journal of Psychoeducational…	2
Language Testing	2
Research in Developmental…	2
Society for Research on…	2
ACT, Inc.	1
Advances in Physiology…	1
American Journal on Mental…	1
More ▼

Liu, Ou Lydia	5
Farina, Kristy	3
LaVenia, Mark	3
Schoen, Robert C.	3
Attali, Yigal	2
Champagne, Zachary M.	2
Dikmenli, Yurdal	2
Hung Tan Ha	2
Mao, Liyang	2
Metsämuuronen, Jari	2
Sijtsma, Klaas	2
Tim Stoeckel	2
Xu, Jun	2
Zhang, Mo	2
Adamu, Gishua Garba	1
Aedo-Saravia, Jaime	1
Ahmed, Tamim	1
Aktas, Elif	1
Al Khasawneh, Mohanad	1
Aldhalaan, Hesham	1
Aldosari, Mohammed	1
Aliyu, Hassan	1
Allan S. Cohen	1
Almeda, Mia	1
More ▼

SAT (College Admission Test)	6
ACT Assessment	3
Program for International…	2
Raven Progressive Matrices	2
Rosenberg Self Esteem Scale	2
Stanford Achievement Tests	2
Trends in International…	2
ACT Interest Inventory	1
Beck Depression Inventory	1
Behavior Assessment System…	1
Center for Epidemiologic…	1
Clinical Evaluation of…	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
Marlowe Crowne Social…	1
Minnesota Multiphasic…	1
Motivated Strategies for…	1
Peabody Developmental Motor…	1
Peabody Picture Vocabulary…	1
Strengths and Difficulties…	1
Teacher Rating Scale	1
Teaching and Learning…	1
Test of English as a Foreign…	1
More ▼