ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Descriptor

Correlation	16
Simulation	16
Test Reliability	16
Comparative Analysis	5
Evaluation Methods	4
Item Response Theory	4
Test Items	4
Computation	3
Measurement Techniques	3
Problem Solving	3
Scores	3
Testing	3
College Students	2
Computer Assisted Testing	2
Difficulty Level	2
Effect Size	2
Higher Education	2
Pretests Posttests	2
Probability	2
Response Style (Tests)	2
Responses	2
Sample Size	2
Scoring	2
Statistical Bias	2
Test Results	2
More ▼

Source

Journal of Educational…	2
Advances in Health Sciences…	1
European Journal of…	1
Journal of Consulting and…	1
Journal of Educational and…	1
Journal of Speech, Language,…	1
Measurement:…	1
National Center for Education…	1
ProQuest LLC	1
Psychometrika	1
Review of Higher Education	1
Society for Research on…	1
More ▼

Publication Type

Reports - Research	12
Journal Articles	10
Reports - Descriptive	2
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Postsecondary Education	3
Higher Education	2
Adult Education	1
Elementary Secondary Education	1
Grade 8	1

Audience

Location

Russia

Laws, Policies, & Programs

Assessments and Surveys

National Survey of Student…

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Short-Term Test-Retest Reliability of Contralateral Suppression of Click-Evoked Otoacoustic Emissions in Normal-Hearing Subjects

Peer reviewed

Direct link

Keppler, Hannah; Degeest, Sofie; Vinck, Bart – Journal of Speech, Language, and Hearing Research, 2021

Purpose: The objective of the current study was to investigate the short-term test-retest reliability of contralateral suppression (CS) of click-evoked otoacoustic emissions (CEOAEs) using commercially available otoacoustic emission equipment. Method: Twenty-three young normal-hearing subjects were tested. An otoscopic evaluation, admittance…

Descriptors: Test Reliability, Hearing (Physiology), Acoustics, Auditory Tests

The Impact of Aberrant Response on Reliability and Validity

Peer reviewed

Direct link

Liu, Tour; Sun, Yicong; Li, Zhen; Xin, Tao – Measurement: Interdisciplinary Research and Perspectives, 2019

Aberrant response has an important impact on item parameter estimation, individuals' evaluation, and other statistical analysis. There are various types of aberrant response behaviors in educational and psychological tests, like sleeping, guessing, and plodding. Random response is the most common one. The purpose of this research was to clarify…

Descriptors: Test Reliability, Test Validity, Item Response Theory, Differences

How Important Are High Response Rates for College Surveys?

Peer reviewed

Direct link

Fosnacht, Kevin; Sarraf, Shimon; Howe, Elijah; Peck, Leah K. – Review of Higher Education, 2017

Surveys play an important role in understanding the higher education landscape. About 60 percent of the published research in major higher education journals utilized survey data (Pike, 2007). Institutions also commonly use surveys to assess student outcomes and evaluate programs, instructors, and even cafeteria food. However, declining survey…

Descriptors: Higher Education, Surveys, Response Rates (Questionnaires), Simulation

Testing Methodology in the Student Learning Process

Peer reviewed
PDF on ERIC

Download full text

Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017

The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…

Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation

A Comparison of Different Psychometric Approaches to Modeling Testlet Structures: An Example with C-Tests

Peer reviewed

Direct link

Schroeders, Ulrich; Robitzsch, Alexander; Schipolowski, Stefan – Journal of Educational Measurement, 2014

C-tests are a specific variant of cloze tests that are considered time-efficient, valid indicators of general language proficiency. They are commonly analyzed with models of item response theory assuming local item independence. In this article we estimated local interdependencies for 12 C-tests and compared the changes in item difficulties,…

Descriptors: Comparative Analysis, Psychometrics, Cloze Procedure, Language Tests

Effect of Violating Unidimensional Item Response Theory Vertical Scaling Assumptions on Developmental Score Scales

Direct link

Topczewski, Anna Marie – ProQuest LLC, 2013

Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…

Descriptors: Item Response Theory, Scaling, Scores, Student Development

Assessing the Conditional Reliability of State Assessments

Download full text

May, Henry; Cole, Russell; Haimson, Josh; Perez-Johnson, Irma – Society for Research on Educational Effectiveness, 2010

The purpose of this study is to provide empirical benchmarks of the conditional reliabilities of state tests for samples of the student population defined by ability level. Given that many educational interventions are targeted for samples of low performing students, schools, or districts, the primary goal of this research is to determine how…

Descriptors: Intervention, Statistical Analysis, Academic Achievement, Test Reliability

Robinson's Measure of Agreement as a Parallel Forms Reliability Coefficient.

Download full text

Willson, Victor L. – 1977

A major deficiency in classical test theory is the reliance on Pearson product-moment (PPM) correlation concepts in the definition of reliability. PPM measures are totally insensitive to first moment differences in tests which leads to the dubious assumption of essential tan-equivalence. Robinson proposed a measure of agreement that is sensitive…

Descriptors: Comparative Analysis, Correlation, Difficulty Level, Mathematical Formulas

Individual Assessment Accuracy.

Peer reviewed

Rudner, Lawrence M. – Journal of Educational Measurement, 1983

Nine indices for assessing the accuracy of an individual's test score were evaluated using simulated item responses to a commercial and a classroom test. The indices appear capable of identifying relatively high proportions of examinees with spurious total scores. (Author/PN)

Descriptors: Correlation, Item Analysis, Latent Trait Theory, Measurement Techniques

Exact Distributions of Intraclass Correlation and Cronbach's Alpha with Gaussian Data and General Covariance

Peer reviewed

Direct link

Kistner, Emily O.; Muller, Keith E. – Psychometrika, 2004

Intraclass correlation and Cronbach's alpha are widely used to describe reliability of tests and measurements. Even with Gaussian data, exact distributions are known only for compound symmetric covariance (equal variances and equal correlations). Recently, large sample Gaussian approximations were derived for the distribution functions. New exact…

Descriptors: Correlation, Test Reliability, Test Results, Probability

Assessing Clinical Significance: Does it Matter which Method we Use?

Peer reviewed

Direct link

Atkins, David C.; Bedics, Jamie D.; Mcglinchey, Joseph B.; Beauchaine, Theodore P. – Journal of Consulting and Clinical Psychology, 2005

Measures of clinical significance are frequently used to evaluate client change during therapy. Several alternatives to the original method devised by N. S. Jacobson, W. C. Follette, & D. Revenstorf (1984) have been proposed, each purporting to increase accuracy. However, researchers have had little systematic guidance in choosing among…

Descriptors: Psychotherapy, Statistical Significance, Outcomes of Treatment, Behavior Change

Problem Solving in Technology-Rich Environments. A Report from the NAEP Technology-Based Assessment Project, Research and Development Series. NCES 2007-466

Peer reviewed
PDF on ERIC

Download full text

Bennett, Randy Elliot; Persky, Hilary; Weiss, Andrew R.; Jenkins, Frank – National Center for Education Statistics, 2007

The Problem Solving in Technology-Rich Environments (TRE) study was designed to demonstrate and explore innovative use of computers for developing, administering, scoring, and analyzing the results of National Assessment of Educational Progress (NAEP) assessments. Two scenarios (Search and Simulation) were created for measuring problem solving…

Descriptors: Computer Assisted Testing, National Competency Tests, Problem Solving, Simulation

A Nonparametric Procedure for Demonstrating a Non-Chance Fit Among Pairs of Multivariate Responses.

Download full text

Mandeville, Garrett K.; And Others – 1975

A strategy for comparing two sets of results (one based upon early childhood recollections (ECR) and another upon video taped (VT) group behavior) from the Perceptual Characteristics Rating Scale was developed. The null distribution of the mean deviation was estimated by randomly matching an ECR response vector with a VT response vector. To…

Descriptors: Comparative Analysis, Correlation, Data Analysis, Goodness of Fit

The Role of a Computerised Case-Based Testing Procedure in Practice Performance Assessment

Peer reviewed

Direct link

Schuwirth, L.; Gorter, S.; Van der Heijde, D.; Rethans, J. J.; Brauer, J.; Houben, H.; Van der Linden, S.; Van der Vleuten, C.; Scherpbier, A. – Advances in Health Sciences Education, 2005

Introduction: For postgraduate training of doctors there is a need for valid and reliable instruments to assess their daily performance. Various instruments have been suggested, some of which use incognito simulated patients (SPs). These methods are resource intensive. Computerised Case-based testing (CCT) is logistically simpler and may still…

Descriptors: Check Lists, Performance Based Assessment, Testing, Predictive Validity

Previous Page | Next Page »

Pages: 1 | 2

Atkins, David C.	1
Beauchaine, Theodore P.	1
Bedics, Jamie D.	1
Bennett, Randy Elliot	1
Brauer, J.	1
Cole, Russell	1
Degeest, Sofie	1
Emons, Wilco H. M.	1
Fosnacht, Kevin	1
Frederiksen, Norman	1
Gorbunova, Tatiana N.	1
Gorter, S.	1
Gu, Zhengguo	1
Haimson, Josh	1
Houben, H.	1
Howe, Elijah	1
Jenkins, Frank	1
Keppler, Hannah	1
Kistner, Emily O.	1
Li, Zhen	1
Liu, Tour	1
Mandeville, Garrett K.	1
May, Henry	1
Mcglinchey, Joseph B.	1
More ▼