ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	12

Descriptor

Comparative Analysis	61
Test Reliability	61
Testing	61
Test Validity	34
Test Construction	14
Higher Education	13
Scores	12
Computer Assisted Testing	11
Language Tests	10
Statistical Analysis	10
Testing Problems	9
Foreign Countries	8
Language Proficiency	8
Scoring	8
Achievement Tests	7
College Students	7
English (Second Language)	7
Response Style (Tests)	7
Second Language Learning	7
Evaluation Methods	6
Multiple Choice Tests	6
Test Format	6
Measurement Techniques	5
Psychometrics	5
Test Interpretation	5
More ▼

Publication Type

Reports - Research	33
Journal Articles	27
Speeches/Meeting Papers	9
Reports - Evaluative	5
Reports - Descriptive	4
Books	2
Guides - Non-Classroom	2
Information Analyses	2
Collected Works - General	1
Opinion Papers	1

Education Level

Higher Education	5
Adult Education	1
Early Childhood Education	1
Elementary Secondary Education	1
High Schools	1
Postsecondary Education	1
Preschool Education	1
Secondary Education	1

Audience

Practitioners	2
Teachers	2
Administrators	1

Location

Australia	2
Greece	1
Greece (Athens)	1
Hungary	1
Iran	1
Japan	1
Ohio	1
Turkey	1
Utah	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
ACTFL Oral Proficiency…	1
Beck Anxiety Inventory	1
California Critical Thinking…	1
Center for Epidemiologic…	1
Embedded Figures Test	1
General Aptitude Test Battery	1
Metropolitan Readiness Tests	1
Michigan Test of English…	1
Minnesota Multiphasic…	1
Rod and Frame Test	1
Rosenberg Self Esteem Scale	1
State Trait Anxiety Inventory	1
Strong Campbell Interest…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 61 results Save | Export

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

Attribute-Level and Pattern-Level Classification Consistency and Accuracy Indices for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Wang, Wenyi; Song, Lihong; Chen, Ping; Meng, Yaru; Ding, Shuliang – Journal of Educational Measurement, 2015

Classification consistency and accuracy are viewed as important indicators for evaluating the reliability and validity of classification results in cognitive diagnostic assessment (CDA). Pattern-level classification consistency and accuracy indices were introduced by Cui, Gierl, and Chang. However, the indices at the attribute level have not yet…

Descriptors: Classification, Reliability, Accuracy, Cognitive Tests

Online Proctored versus Unproctored Low-Stakes Internet Test Administration: Is There Differential Test-Taking Behavior and Performance?

Peer reviewed

Direct link

Rios, Joseph A.; Liu, Ou Lydia – American Journal of Distance Education, 2017

Online higher education institutions are presented with the concern of how to obtain valid results when administering student learning outcomes (SLO) assessments remotely. Traditionally, there has been a great reliance on unproctored Internet test administration (UIT) due to increased flexibility and reduced costs; however, a number of validity…

Descriptors: Online Courses, Testing, Test Wiseness, Academic Achievement

Computer-Based and Paper-Based Testing: Does the Test Administration Mode Influence the Reliability and Validity of Achievement Tests?

Peer reviewed
PDF on ERIC

Download full text

Öz, Hüseyin; Özturan, Tuba – Journal of Language and Linguistic Studies, 2018

This article reports the findings of a study that sought to investigate whether computer-based vs. paper-based test-delivery mode has an impact on the reliability and validity of an achievement test for a pedagogical content knowledge course in an English teacher education program. A total of 97 university students enrolled in the English as a…

Descriptors: Computer Assisted Testing, Testing, Test Format, Teaching Methods

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

The Estimation of the IRT Reliability Coefficient and Its Lower and Upper Bounds, with Comparisons to CTT Reliability Statistics

Peer reviewed

Direct link

Kim, Seonghoon; Feldt, Leonard S. – Asia Pacific Education Review, 2010

The primary purpose of this study is to investigate the mathematical characteristics of the test reliability coefficient rho[subscript XX'] as a function of item response theory (IRT) parameters and present the lower and upper bounds of the coefficient. Another purpose is to examine relative performances of the IRT reliability statistics and two…

Descriptors: Testing, Test Reliability, Statistics, Item Response Theory

Internet Administration of Paper-and-Pencil Questionnaires Used in Couple Research: Assessing Psychometric Equivalence

Peer reviewed

Direct link

Brock, Rebecca L.; Barry, Robin A.; Lawrence, Erika; Dey, Jodi; Rolffs, Jaci – Assessment, 2012

This study examined the psychometric equivalence of paper-and-pencil and Internet formats of key questionnaires used in couple research. Self-report questionnaires assessing interpersonal constructs (relationship satisfaction, communication/conflict management, partner support, emotional intimacy) and intrapersonal constructs (individual traits,…

Descriptors: Satisfaction, Conflict, Intimacy, Questionnaires

Evaluation of Children's Creativity: Psychometric Properties of Torrance's "Thinking Creatively in Action and Movement" Test

Peer reviewed

Direct link

Zachopoulou, Evridiki; Makri, Anastasia; Pollatou, Elisana – Early Child Development and Care, 2009

The purpose of this study was to examine the test-retest reliability of Torrance's "Thinking Creatively in Action and Movement" (TCAM) test and the relationship between TCAM and the Divergent Movement Ability (DMA) test. The TCAM and DMA tests were used for a sample of 115 children, while the whole experimental procedure included three…

Descriptors: Creativity Tests, Test Reliability, Psychometrics, Preschool Children

Using the Method of Pairwise Comparison to Obtain Reliable Teacher Assessments

Peer reviewed
PDF on ERIC

Download full text

Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010

Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…

Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)

Collaboration in Language Testing and Assessment. Language Testing and Evaluation. Volume 26

Direct link

Tsagari, Dina, Ed.; Csepes, Ildiko, Ed. – Peter Lang Frankfurt, 2012

The Guidelines for Good Practice of the European Association for Language Testing and Assessment (EALTA) stress the importance of collaboration between all parties involved in the process of developing instruments, activities and programmes for testing and assessment. Collaboration is considered to be as important as validity and reliability,…

Descriptors: Sign Language, Testing, Language Tests, Test Validity

Assessing Students' Conceptual Understanding after a First Course in Statistics

Peer reviewed

Direct link

delMas, Robert; Garfield, Joan; Ooms, Ann; Chance, Beth – Statistics Education Research Journal, 2007

This paper describes the development of the CAOS test, designed to measure students' conceptual understanding of important statistical ideas, across three years of revision and testing, content validation, and reliability analysis. Results are reported from a large scale class testing and item responses are compared from pretest to posttest in…

Descriptors: Testing, Statistics, Misconceptions, Concept Formation

Study to Compare Reliability of Performance on Live and Recorded Dictation Tests.

Download full text

Manpower Administration (DOL), Washington, DC. U.S. Training and Employment Service. – 1969

To compare the reliability of performance on recorded dictation tests with performance on live tests, 216 university students who were nearing completion of an intermediate shorthand course and 26 job applicants seeking stenographic positions were divided into 10 groups, with five receiving live dictation and five receiving recorded dictation. The…

Descriptors: Comparative Analysis, Comparative Testing, Evaluation, Performance Tests

Sampling of Common Items: An Unrecognized Source of Error in Test Equating. CSE Report 636

Download full text

Michaelides, Michalis P.; Haertel, Edward H. – Center for Research on Evaluation Standards and Student Testing CRESST, 2004

There is variability in the estimation of an equating transformation because common-item parameters are obtained from responses of samples of examinees. The most commonly used standard error of equating quantifies this source of sampling error, which decreases as the sample size of examinees used to derive the transformation increases. In a…

Descriptors: Test Items, Testing, Error Patterns, Interrater Reliability

The Reduced Size Rod and Frame Test as a Measure of Psychological Differentiation

Peer reviewed

Nickel, Ted – Educational and Psychological Measurement, 1971

Directions are provided for the construction of a reduced size Rod and Frame Test. Simpler and less expensive, the proposed apparatus has criterion validity parallel to that of the full-sized. (GS)

Descriptors: Comparative Analysis, Psychological Studies, Sex Differences, Statistical Analysis

Research on the Comparability of the Oral Proficiency Interview and the Simulated Oral Proficiency Interview.

Peer reviewed

Stansfield, Charles W.; Kenyon, Dorry Mann – System, 1992

Reviews research that sheds light on the comparability of Oral Proficiency Interview and the Simulated Oral Proficiency Interview. Suggestions are provided for further research. (16 references) (VWL)

Descriptors: Comparative Analysis, Interviews, Language Proficiency, Language Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Journal of Educational…	3
Language Testing	2
ACT Education Corp.	1
American Journal of Distance…	1
Asia Pacific Education Review	1
Assessment	1
Australian Educational…	1
Bulletin of Faculty of…	1
Center for Research on…	1
Communique	1
Computers in Human Behavior	1
Developmental Psychology	1
Early Child Development and…	1
Edinburgh Working Papers in…	1
Educational and Psychological…	1
Illinois School Research and…	1
International Journal of…	1
International Review of…	1
Journal of Experimental…	1
Journal of Language and…	1
Journal of Research and…	1
Journal of Science Education…	1
Journal of Special Education	1
Measurement and Evaluation in…	1
Modern Language Journal	1
More ▼

Weiss, David J.	3
Betz, Nancy E.	2
Hakstian, A. Ralph	2
Kansup, Wanlop	2
Kapes, Jerome T.	2
Adams, R. J.	1
Algina, James	1
Anderson, Paul S.	1
Barry, Robin A.	1
Bennett, Randy Elliot	1
Brock, Rebecca L.	1
Chance, Beth	1
Chase, Clinton I.	1
Chen, Ping	1
Csepes, Ildiko, Ed.	1
Cummings, Oliver W.	1
Day, Gerald F.	1
Dey, Jodi	1
Ding, Shuliang	1
Donlon, Thomas F.	1
Dryden, Russell E.	1
Engelhard, George, Jr.	1
Feldt, Leonard S.	1
Ferrer, Emilio	1
More ▼