Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedGoh, David S. – Journal of Clinical Psychology, 1980
Examined the validity coefficients of all possible WISC-R short forms of several subtests. Comparisons were made between coefficients given by McNemar's and Silverstein's formulas to determine "best" short forms for different uses. Results indicated only a slight difference between short forms selected by the two methods. (Author)
Descriptors: Children, Psychological Testing, Test Construction, Test Validity
Peer reviewedGoodwin, Laura D.; Leech, Nancy L. – Measurement and Evaluation in Counseling and Development, 2003
The treatment of validity in the newest edition of "Standards for Educational and Psychological Testing" is quite different from coverage in earlier editions of the Standards and in most measurement textbooks. The view of validity in the 1999 Standards is discussed, and suggestions for instructors of measurement courses are offered. (Contains 56…
Descriptors: Educational Testing, Evaluation Methods, Psychological Testing, Standards
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Noble, Tracy; Suarez, Catherine; Rosebery, Ann; O'Connor, Mary Catherine; Warren, Beth; Hudicourt-Barnes, Josiane – Journal of Research in Science Teaching, 2012
Education policy in the U.S. in the last two decades has emphasized large-scale assessment of students, with growing consequences for schools, teachers, and students. Given the high stakes of such tests, it is important to understand the relationships between students' answers to test items and their knowledge and skills in the tested content…
Descriptors: Testing, Science Tests, Second Language Learning, Measures (Individuals)
Sampson, Demetrios G., Ed.; Ifenthaler, Dirk, Ed.; Isaías, Pedro, Ed. – International Association for Development of the Information Society, 2021
These proceedings contain the papers of the 18th International Conference on Cognition and Exploratory Learning in the Digital Age (CELDA 2021), held virtually, due to an exceptional situation caused by the COVID-19 pandemic, from October 13-15, 2021, and organized by the International Association for Development of the Information Society…
Descriptors: Computer Simulation, Open Educational Resources, Telecommunications, Handheld Devices
Barry, Carol L.; Finney, Sara J. – Research & Practice in Assessment, 2009
The effects of gathering test scores under low-stakes conditions has been a prominent domain of research in the assessment and testing literature. One important area within this larger domain concerns the implications of a test being low-stakes on test evaluation and development. The current study examined one variable, the testing context, that…
Descriptors: Testing, Context Effect, Comparative Analysis, Test Validity
Green, Donald Ross; Draper, John F. – 1972
This paper considers the question of bias in group administered academic achievement tests, bias which is inherent in the instruments themselves. A body of data on the test of performance of three disadvantaged minority groups--northern, urban black; southern, rural black; and, southwestern, Mexican-Americans--as tryout samples in contrast to…
Descriptors: Achievement Tests, Bias, Comparative Testing, Educational Testing
National Inst. of Education (ED), Washington, DC. – 1981
Barbara Jordan served as the hearing officer for three-day adversary evaluation hearings about the pros and cons of minimum competency testing (MCT). This report is the complete transcript of the second day of proceedings. The pro team, lead by James Popham, began by presenting representatives of four states (Florida, California, Texas, and…
Descriptors: Cutting Scores, Elementary Secondary Education, Hearings, Minimum Competency Testing
Leclercq, Dieudonne – Evaluation in Education: An International Review Series, 1982
In a confidence weighting situation, the examinee is asked to indicate the correct answer, and how certain he or she is of the correctness of that answer. This paper reviews the bases for confidence marking, its validity and accuracy in evaluating students, and it's use in research. (BW)
Descriptors: Confidence Testing, Educational Research, Measurement Techniques, Models
Casey, Emmett – New Directions for Community Colleges, 1987
Offers background on issues related to the testing of disabled students. Reports on a survey about testing accommodation for disabled students provided by California community colleges. Recommends additional ways in which testing practices can be modified to meet the needs of disabled students. (DMM)
Descriptors: Accessibility (for Disabled), Community Colleges, Design Requirements, Educational Testing
Peer reviewedCurtis, Connie June; And Others – Educational and Psychological Measurement, 1979
The score distributions of the two methods of administration described in the title revealed comparable means, standard deviations, and general shape of distribution. With respect to validity coefficients, no appreciable differences were found. (JKS)
Descriptors: Comparative Testing, Educational Testing, Eye Hand Coordination, Grade 2
Hirai, Akiyo; Koizumi, Rie – Language Assessment Quarterly, 2009
This article presents a test development project for classroom speaking assessment. With the aim of enhancing and specifically easing the process of test preparation and administration and generating positive washback effects on learning, we developed a semi-direct speaking test called the Story Retelling Speaking Test (SRST). Although a story…
Descriptors: Speech Communication, Language Tests, Test Construction, Story Telling
Mueller, Karsten; Liebig, Christian; Hattrup, Keith – Educational and Psychological Measurement, 2007
Two quasi-experimental field studies were conducted to evaluate the psychometric equivalence of computerized and paper-and-pencil job satisfaction measures. The present research extends previous work in the area by providing better control of common threats to validity in quasi-experimental research on test mode effects and by evaluating a more…
Descriptors: Psychometrics, Field Studies, Job Satisfaction, Computer Assisted Testing
Allen, Nancy L.; And Others – 1992
Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…
Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling
Moreno, Kathleen E.; And Others – 1983
The relationship between selected subtests from the Armed Services Vocational Aptitude Battery (ASVAB) and corresponding subtests administered as computerized adaptive tests (CAT) was investigated using a sample of Marine recruits. Results showed that the CAT subtest scores correlated as well with initial ASVAB scores as did ASVAB retest scores,…
Descriptors: Adaptive Testing, Aptitude Tests, Computer Assisted Testing, Correlation

Direct link
