ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	13

Descriptor

Comparative Analysis	25
Test Reliability	25
Simulation	19
Correlation	8
Computer Simulation	6
Computer Assisted Testing	5
Error of Measurement	5
Item Response Theory	5
Mathematical Models	5
Scores	5
Test Validity	5
Probability	4
Statistical Analysis	4
Test Format	4
Test Items	4
Test Length	4
Accuracy	3
Classification	3
College Students	3
Computation	3
Difficulty Level	3
Higher Education	3
Individual Differences	3
Item Banks	3
Language Tests	3
More ▼

Source

Journal of Educational…	3
Educational Measurement:…	2
Psychometrika	2
ETS Research Report Series	1
Education and Information…	1
Educational and Psychological…	1
IEEE Transactions on Learning…	1
International Association for…	1
Journal of Educational Issues	1
Journal of Speech, Language,…	1
Journal of Technology and…	1
Review of Higher Education	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	14
Speeches/Meeting Papers	4
Reports - Evaluative	3
Collected Works - Proceedings	1
Reports - Descriptive	1

Education Level

Higher Education	3
Postsecondary Education	2
Elementary Secondary Education	1
Secondary Education	1

Audience

Location

Asia	1
Australia	1
Brazil	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Israel	1
Italy	1
Japan	1
Kazakhstan	1
Netherlands	1
Norway	1
Ohio	1
Pakistan	1
Pennsylvania	1
Philippines	1
Portugal	1
Singapore	1
South Korea	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Survey of Student…

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Short-Term Test-Retest Reliability of Contralateral Suppression of Click-Evoked Otoacoustic Emissions in Normal-Hearing Subjects

Peer reviewed

Direct link

Keppler, Hannah; Degeest, Sofie; Vinck, Bart – Journal of Speech, Language, and Hearing Research, 2021

Purpose: The objective of the current study was to investigate the short-term test-retest reliability of contralateral suppression (CS) of click-evoked otoacoustic emissions (CEOAEs) using commercially available otoacoustic emission equipment. Method: Twenty-three young normal-hearing subjects were tested. An otoscopic evaluation, admittance…

Descriptors: Test Reliability, Hearing (Physiology), Acoustics, Auditory Tests

Integration of Interactive Computer Simulations in Teaching and Learning Chemical Reaction: Students' Performance and Concept Retention

Peer reviewed
PDF on ERIC

Download full text

Jane Batamuliza; Gonzague Habinshuti; Jean Baptiste Nkurunziza – Journal of Technology and Science Education, 2024

This current study presents the effects of interactive computer simulations on students' performance and concept retention in the unit of chemical reactions. Purposive sampling was used to select four schools with a sample population of 320. The Achievement test on chemical reactions was developed, validated, and checked for reliability. The…

Descriptors: Chemistry, Science Instruction, Teaching Methods, Comparative Analysis

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Systematic Comparison of Decision Accuracy of Complex Compensatory Decision Rules Combining Multiple Tests in a Higher Education Context

Peer reviewed

Direct link

Yocarini, Iris E.; Bouwmeester, Samantha; Smeets, Guus; Arends, Lidia R. – Educational Measurement: Issues and Practice, 2018

This real-data-guided simulation study systematically evaluated the decision accuracy of complex decision rules combining multiple tests within different realistic curricula. Specifically, complex decision rules combining conjunctive aspects and compensatory aspects were evaluated. A conjunctive aspect requires a minimum level of performance,…

Descriptors: Comparative Analysis, Decision Making, Accuracy, Higher Education

Reliably Assessing Growth with Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019

Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…

Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests

How Important Are High Response Rates for College Surveys?

Peer reviewed

Direct link

Fosnacht, Kevin; Sarraf, Shimon; Howe, Elijah; Peck, Leah K. – Review of Higher Education, 2017

Surveys play an important role in understanding the higher education landscape. About 60 percent of the published research in major higher education journals utilized survey data (Pike, 2007). Institutions also commonly use surveys to assess student outcomes and evaluate programs, instructors, and even cafeteria food. However, declining survey…

Descriptors: Higher Education, Surveys, Response Rates (Questionnaires), Simulation

Attribute-Level and Pattern-Level Classification Consistency and Accuracy Indices for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Wang, Wenyi; Song, Lihong; Chen, Ping; Meng, Yaru; Ding, Shuliang – Journal of Educational Measurement, 2015

Classification consistency and accuracy are viewed as important indicators for evaluating the reliability and validity of classification results in cognitive diagnostic assessment (CDA). Pattern-level classification consistency and accuracy indices were introduced by Cui, Gierl, and Chang. However, the indices at the attribute level have not yet…

Descriptors: Classification, Reliability, Accuracy, Cognitive Tests

DIF Analysis with Multilevel Data: A Simulation Study Using the Latent Variable Approach

Peer reviewed
PDF on ERIC

Download full text

Jin, Ying; Eason, Hershel – Journal of Educational Issues, 2016

The effects of mean ability difference (MAD) and short tests on the performance of various DIF methods have been studied extensively in previous simulation studies. Their effects, however, have not been studied under multilevel data structure. MAD was frequently observed in large-scale cross-country comparison studies where the primary sampling…

Descriptors: Test Bias, Simulation, Hierarchical Linear Modeling, Comparative Analysis

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

A Comparison of Different Psychometric Approaches to Modeling Testlet Structures: An Example with C-Tests

Peer reviewed

Direct link

Schroeders, Ulrich; Robitzsch, Alexander; Schipolowski, Stefan – Journal of Educational Measurement, 2014

C-tests are a specific variant of cloze tests that are considered time-efficient, valid indicators of general language proficiency. They are commonly analyzed with models of item response theory assuming local item independence. In this article we estimated local interdependencies for 12 C-tests and compared the changes in item difficulties,…

Descriptors: Comparative Analysis, Psychometrics, Cloze Procedure, Language Tests

Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

Peer reviewed

Direct link

Yao, Lihua – Psychometrika, 2012

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…

Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Robinson's Measure of Agreement as a Parallel Forms Reliability Coefficient.

Download full text

Willson, Victor L. – 1977

A major deficiency in classical test theory is the reliance on Pearson product-moment (PPM) correlation concepts in the definition of reliability. PPM measures are totally insensitive to first moment differences in tests which leads to the dubious assumption of essential tan-equivalence. Robinson proposed a measure of agreement that is sensitive…

Descriptors: Comparative Analysis, Correlation, Difficulty Level, Mathematical Formulas

A K-Sample Significance Test for Independent Alpha Coefficients

Peer reviewed

Hakstian, A. Ralph; Whalen, Thomas E. – Psychometrika, 1976

Details of a reasonably precise normalization technique for coefficient alpha are outlined, along with methods for estimating the variance of the normalized statistic. These procedures lead to the K-sample significance test. (RC)

Descriptors: Analysis of Variance, Comparative Analysis, Error Patterns, Hypothesis Testing

A Comparative Analysis of Simulated and Direct Oral Proficiency Interviews.

Download full text

Stansfield, Charles W. – 1990

The simulated oral proficiency interview (SOPI) is a semi-direct speaking test that models the format of the oral proficiency interview (OPI). The OPI is a method of assessing general speaking proficiency in a second language. The SOPI is a tape-recorded test consisting of six parts: simple personal background questions posed in a simulated…

Descriptors: Comparative Analysis, Interviews, Language Proficiency, Language Tests

Previous Page | Next Page »

Pages: 1 | 2

Arends, Lidia R.	1
Betz, Nancy E.	1
Bouwmeester, Samantha	1
Brown, R. L.	1
Chen, Ping	1
Degeest, Sofie	1
Ding, Shuliang	1
Eason, Hershel	1
Eignor, Daniel R.	1
Fosnacht, Kevin	1
Frary, Robert B.	1
Gelbal, Selahattin	1
Gonzague Habinshuti	1
Hakstian, A. Ralph	1
Hambleton, Ronald K.	1
Howe, Elijah	1
Jane Batamuliza	1
Jean Baptiste Nkurunziza	1
Jin, Ying	1
Keppler, Hannah	1
Kocher, A. Thel	1
Madison, Matthew J.	1
Mandeville, Garrett K.	1
Marston, Paul T., Borich,…	1
More ▼