Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 8 |
Descriptor
Statistical Analysis | 60 |
Test Reliability | 60 |
Testing | 60 |
Test Validity | 32 |
Test Construction | 14 |
Test Interpretation | 14 |
Measurement Techniques | 12 |
Academic Achievement | 10 |
Comparative Analysis | 10 |
Scores | 10 |
Student Evaluation | 10 |
More ▼ |
Source
Author
ANDRADE, MANUEL | 1 |
Algina, James | 1 |
Ames, Russell | 1 |
Andrulis, Richard S. | 1 |
Bayazidi, Aso | 1 |
Blatchford, Charles H. | 1 |
Brown, Thomas A. | 1 |
CURTIS, H.A. | 1 |
Chambers, Francine | 1 |
Cohen, Allan S., Comp. | 1 |
Cole, Russell | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 4 |
Elementary Secondary Education | 2 |
Postsecondary Education | 2 |
Audience
Practitioners | 1 |
Location
United Kingdom (England) | 2 |
Australia | 1 |
California (Stanford) | 1 |
Colorado (Denver) | 1 |
Iran | 1 |
Japan | 1 |
Turkey | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Bayazidi, Aso; Saeb, Fateme – Advances in Language and Literary Studies, 2017
This study examined the equivalence and reliability of the two versions of the Vocabulary Levels Test in an Iranian context. This study was motivated by the fact that the Vocabulary Levels test is increasingly being used in Iran for both research and pedagogical purposes without having been checked for validity and reliability in this context. The…
Descriptors: Foreign Countries, Vocabulary, English (Second Language), College Second Language Programs
Rios, Joseph A.; Liu, Ou Lydia – American Journal of Distance Education, 2017
Online higher education institutions are presented with the concern of how to obtain valid results when administering student learning outcomes (SLO) assessments remotely. Traditionally, there has been a great reliance on unproctored Internet test administration (UIT) due to increased flexibility and reduced costs; however, a number of validity…
Descriptors: Online Courses, Testing, Test Wiseness, Academic Achievement
Öz, Hüseyin; Özturan, Tuba – Journal of Language and Linguistic Studies, 2018
This article reports the findings of a study that sought to investigate whether computer-based vs. paper-based test-delivery mode has an impact on the reliability and validity of an achievement test for a pedagogical content knowledge course in an English teacher education program. A total of 97 university students enrolled in the English as a…
Descriptors: Computer Assisted Testing, Testing, Test Format, Teaching Methods
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
May, Henry; Cole, Russell; Haimson, Josh; Perez-Johnson, Irma – Society for Research on Educational Effectiveness, 2010
The purpose of this study is to provide empirical benchmarks of the conditional reliabilities of state tests for samples of the student population defined by ability level. Given that many educational interventions are targeted for samples of low performing students, schools, or districts, the primary goal of this research is to determine how…
Descriptors: Intervention, Statistical Analysis, Academic Achievement, Test Reliability
Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010
Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…
Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)

Subkoviak, Michael J. – Journal of Educational Measurement, 1988
Current methods for obtaining reliability indices for mastery tests can be laborious. This paper offers practitioners tables from which agreement and kappa coefficients can be read directly and provides criterion for acceptable values of agreement and kappa coefficients. (TJH)
Descriptors: Mastery Tests, Statistical Analysis, Test Reliability, Testing

Veitch, William R.; Roscoe, John T. – Journal of Experimental Education, 1974
A Monte Carlo technique was employed in order to compare the relative power and robustness of the Bartlett, Cochran, Hartley, and Levene tests for homogeniety of variance. (Editor)
Descriptors: Research Methodology, Statistical Analysis, Statistical Data, Test Reliability

Kristof, Walter – Psychometrika, 1974
Descriptors: Models, Statistical Analysis, Test Reliability, Testing

Ramsay, J. O. – Educational and Psychological Measurement, 1971
The consequences of the assumption that the expected score is equal to the true score are shown and alternatives discussed. (MS)
Descriptors: Psychological Testing, Statistical Analysis, Test Reliability, Testing
Livingston, Ronald B.; Jennings, Earl; Colotla, Victor A.; Reynolds, Cecil R.; Shercliffe, Regan J. – Psychological Assessment, 2006
In this study, the authors examined the stability of Minnesota Multiphasic Personality Inventory--2 (J. N. Butcher, W. G. Dahlstrom, J. R. Graham, A. Tellegen, & B. Kaemmer, 1989) code types in a sample of 94 injured workers with a mean test-retest interval of 21.3 months (SD = 14.1). Congruence rates for undefined code types were 34% for…
Descriptors: Congruence (Psychology), Injuries, Personality Measures, Test Reliability

Echternacht, Gary – Educational and Psychological Measurement, 1975
Estimates for the variances of empirically determined scoring weights are given. It is also shown that test item writers should write distractors that discriminate on the criterion variable when this type of scoring is used. (Author)
Descriptors: Scoring, Statistical Analysis, Test Construction, Test Reliability

CURTIS, H.A.; KROPP, R.P. – 1961
A STUDY WAS CONDUCTED TO DETERMINE CHANGES IN SCORE CHARACTERISTICS RELATED TO VARIOUS CONDITIONS OF SPEEDEDNESS WHEN A TEST IS PRESENTED VISUALLY BY PROJECTING ONE ITEM AT A TIME. THIS METHOD OF STUDY IS POTENTIALLY USEFUL FOR PRESENTING TEST MATERIALS BY TELEVISION. THE TOTAL SCORE AND PART SCORE DATA REVEALED HIGH RELATIONSHIPS BETWEEN THE…
Descriptors: Instructional Innovation, Projection Equipment, Statistical Analysis, Test Reliability
Stamm, Carol L. – Journal of Physical Education and Recreation, 1976
Use of the coefficient of concordance is explained to be a simple technique for estimating reliability in small scale situation and is advocated as a valuable method for physical educators. (GW)
Descriptors: Measurement Techniques, Nonparametric Statistics, Physical Education, Statistical Analysis