NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)2
Since 2006 (last 20 years)11
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 60 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wenyi; Song, Lihong; Chen, Ping; Meng, Yaru; Ding, Shuliang – Journal of Educational Measurement, 2015
Classification consistency and accuracy are viewed as important indicators for evaluating the reliability and validity of classification results in cognitive diagnostic assessment (CDA). Pattern-level classification consistency and accuracy indices were introduced by Cui, Gierl, and Chang. However, the indices at the attribute level have not yet…
Descriptors: Classification, Reliability, Accuracy, Cognitive Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Rios, Joseph A.; Liu, Ou Lydia – American Journal of Distance Education, 2017
Online higher education institutions are presented with the concern of how to obtain valid results when administering student learning outcomes (SLO) assessments remotely. Traditionally, there has been a great reliance on unproctored Internet test administration (UIT) due to increased flexibility and reduced costs; however, a number of validity…
Descriptors: Online Courses, Testing, Test Wiseness, Academic Achievement
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Öz, Hüseyin; Özturan, Tuba – Journal of Language and Linguistic Studies, 2018
This article reports the findings of a study that sought to investigate whether computer-based vs. paper-based test-delivery mode has an impact on the reliability and validity of an achievement test for a pedagogical content knowledge course in an English teacher education program. A total of 97 university students enrolled in the English as a…
Descriptors: Computer Assisted Testing, Testing, Test Format, Teaching Methods
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Seonghoon; Feldt, Leonard S. – Asia Pacific Education Review, 2010
The primary purpose of this study is to investigate the mathematical characteristics of the test reliability coefficient rho[subscript XX'] as a function of item response theory (IRT) parameters and present the lower and upper bounds of the coefficient. Another purpose is to examine relative performances of the IRT reliability statistics and two…
Descriptors: Testing, Test Reliability, Statistics, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Brock, Rebecca L.; Barry, Robin A.; Lawrence, Erika; Dey, Jodi; Rolffs, Jaci – Assessment, 2012
This study examined the psychometric equivalence of paper-and-pencil and Internet formats of key questionnaires used in couple research. Self-report questionnaires assessing interpersonal constructs (relationship satisfaction, communication/conflict management, partner support, emotional intimacy) and intrapersonal constructs (individual traits,…
Descriptors: Satisfaction, Conflict, Intimacy, Questionnaires
Peer reviewed Peer reviewed
Direct linkDirect link
Zachopoulou, Evridiki; Makri, Anastasia; Pollatou, Elisana – Early Child Development and Care, 2009
The purpose of this study was to examine the test-retest reliability of Torrance's "Thinking Creatively in Action and Movement" (TCAM) test and the relationship between TCAM and the Divergent Movement Ability (DMA) test. The TCAM and DMA tests were used for a sample of 115 children, while the whole experimental procedure included three…
Descriptors: Creativity Tests, Test Reliability, Psychometrics, Preschool Children
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010
Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…
Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)
Tsagari, Dina, Ed.; Csepes, Ildiko, Ed. – Peter Lang Frankfurt, 2012
The Guidelines for Good Practice of the European Association for Language Testing and Assessment (EALTA) stress the importance of collaboration between all parties involved in the process of developing instruments, activities and programmes for testing and assessment. Collaboration is considered to be as important as validity and reliability,…
Descriptors: Sign Language, Testing, Language Tests, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
delMas, Robert; Garfield, Joan; Ooms, Ann; Chance, Beth – Statistics Education Research Journal, 2007
This paper describes the development of the CAOS test, designed to measure students' conceptual understanding of important statistical ideas, across three years of revision and testing, content validation, and reliability analysis. Results are reported from a large scale class testing and item responses are compared from pretest to posttest in…
Descriptors: Testing, Statistics, Misconceptions, Concept Formation
Manpower Administration (DOL), Washington, DC. U.S. Training and Employment Service. – 1969
To compare the reliability of performance on recorded dictation tests with performance on live tests, 216 university students who were nearing completion of an intermediate shorthand course and 26 job applicants seeking stenographic positions were divided into 10 groups, with five receiving live dictation and five receiving recorded dictation. The…
Descriptors: Comparative Analysis, Comparative Testing, Evaluation, Performance Tests
Michaelides, Michalis P.; Haertel, Edward H. – Center for Research on Evaluation Standards and Student Testing CRESST, 2004
There is variability in the estimation of an equating transformation because common-item parameters are obtained from responses of samples of examinees. The most commonly used standard error of equating quantifies this source of sampling error, which decreases as the sample size of examinees used to derive the transformation increases. In a…
Descriptors: Test Items, Testing, Error Patterns, Interrater Reliability
Peer reviewed Peer reviewed
Nickel, Ted – Educational and Psychological Measurement, 1971
Directions are provided for the construction of a reduced size Rod and Frame Test. Simpler and less expensive, the proposed apparatus has criterion validity parallel to that of the full-sized. (GS)
Descriptors: Comparative Analysis, Psychological Studies, Sex Differences, Statistical Analysis
Peer reviewed Peer reviewed
Stansfield, Charles W.; Kenyon, Dorry Mann – System, 1992
Reviews research that sheds light on the comparability of Oral Proficiency Interview and the Simulated Oral Proficiency Interview. Suggestions are provided for further research. (16 references) (VWL)
Descriptors: Comparative Analysis, Interviews, Language Proficiency, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Howell, Scott L. – New Directions for Teaching and Learning, 2004
Although instructional methods are moving in ever greater number to a multimedia base, testing is not. What principles should be considered in correcting this misalignment?
Descriptors: Multimedia Instruction, Teaching Methods, Test Validity, Test Reliability
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4