Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Ho, Andrew – Measurement: Interdisciplinary Research and Perspectives, 2013
In his thoughtful focus article, Haertel (this issue) pushes testing experts to broaden the scope of their validation efforts and to invite scholars from other disciplines to join them. He credits existing validation frameworks for helping the measurement community to identify incomplete or nonexistent validity arguments. However, he notes his…
Descriptors: Educational Testing, Scores, Test Use, Test Validity
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
The author is deeply gratified by the commentators' thoughtful responses and finds almost nothing to disagree with in any of them. Each offers additional insights prompting further reflection. In drawing out just a few common themes, this brief rejoinder omits many important ideas from the individual contributions. As stated in his title, the…
Descriptors: Educational Testing, Educational Improvement, Test Interpretation, Test Use
Doskey, Elena M.; Lagunas, Brenda; SooHoo, Michelle; Lomax, Amanda; Bullick, Stephanie – Journal of Psychoeducational Assessment, 2013
The Speed DIAL-4 was developed from the Developmental Indicators for the Assessment of Learning, Fourth Edition (DIAL-4), a screening designed to identify children between the ages of 2 years, 6 months through 5 years, 11 months "who are in need of intervention or diagnostic assessment in the following areas: motor, concepts, language,…
Descriptors: Screening Tests, Young Children, Test Length, Scoring
Behizadeh, Nadia; Engelhard, George, Jr. – Measurement: Interdisciplinary Research and Perspectives, 2015
In his focus article, Koretz (this issue) argues that accountability has become the primary function of large-scale testing in the United States. He then points out that tests being used for accountability purposes are flawed and that the high-stakes nature of these tests creates a context that encourages score inflation. Koretz is concerned about…
Descriptors: Communities of Practice, High Stakes Tests, Testing, Test Validity
Razi, Salim – SAGE Open, 2015
Similarity reports of plagiarism detectors should be approached with caution as they may not be sufficient to support allegations of plagiarism. This study developed a 50-item rubric to simplify and standardize evaluation of academic papers. In the spring semester of 2011-2012 academic year, 161 freshmen's papers at the English Language Teaching…
Descriptors: Foreign Countries, Scoring Rubrics, Writing Evaluation, Writing (Composition)
Dean, Shannon R. – Journal of Student Affairs Research and Practice, 2017
Developing multiculturally competent citizens is at the forefront of the espoused mission of higher education. The purpose of this study was to develop and validate a self-report instrument to measure traditional-age (18-to 24-year-old) college students' multicultural consciousness (e.g., awareness of self, knowledge of difference, and…
Descriptors: Multicultural Education, Cultural Literacy, Cultural Awareness, Interpersonal Competence
Hauser, Peter C.; Paludneviciene, Raylene; Riddle, Wanda; Kurz, Kim B.; Emmorey, Karen; Contreras, Jessica – Journal of Deaf Studies and Deaf Education, 2016
The American Sign Language Comprehension Test (ASL-CT) is a 30-item multiple-choice test that measures ASL receptive skills and is administered through a website. This article describes the development and psychometric properties of the test based on a sample of 80 college students including deaf native signers, hearing native signers, deaf…
Descriptors: American Sign Language, Comprehension, Multiple Choice Tests, Receptive Language
McGill, Ryan J.; Styck, Kara M.; Palomares, Ronald S.; Hass, Michael R. – Learning Disability Quarterly, 2016
As a result of the upcoming Federal reauthorization of the Individuals With Disabilities Education Improvement Act (IDEA), practitioners and researchers have begun vigorously debating what constitutes evidence-based assessment for the identification of specific learning disability (SLD). This debate has resulted in strong support for a method that…
Descriptors: Learning Disabilities, Disability Identification, Disabilities, Federal Legislation
Dynia, Jaclyn M.; Schachter, Rachel E.; Piasta, Shayne B.; Justice, Laura M.; O'Connell, Ann A.; Yeager Pelatti, Christina – Grantee Submission, 2016
This study investigated the dimensionality of the physical literacy environment of early childhood education classrooms. Data on the classroom physical literacy environment were collected from 245 classrooms using the Classroom Literacy Observation Profile. A combination of confirmatory and exploratory factor analysis was used to identify five…
Descriptors: Early Childhood Education, Classroom Environment, Literacy Education, Factor Analysis
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
Quaid, Ethan Douglas – International Journal of Computer-Assisted Language Learning and Teaching, 2018
The present trend in developing and using semi-direct speaking tests has been supported by test developers and researchers' claim of their increased practicality, higher reliability and concurrent validity with test scores in direct oral proficiency interviews. However, it is universally agreed within the language testing and assessment community…
Descriptors: Case Studies, Speech Communication, Language Tests, Comparative Analysis
Sidorov, Oleg V.; Kozub, Lyubov' V.; Goferberg, Alexander V.; Osintseva, Natalya V. – European Journal of Contemporary Education, 2018
The article discusses the methodological approach to the technology of the educational experiment performance, the ways of the research data processing by means of research methods and methods of mathematical statistics. The article shows the integrated use of some effective approaches to the training of the students majoring in…
Descriptors: Statistical Analysis, Technology Education, Laboratory Equipment, Technology Uses in Education
Chen, Huilin – Journal of Education and Learning, 2014
The validity of the computer-based language test is possibly affected by three factors: computer familiarity, audio-visual cognitive competence, and other discrepancies in construct. Therefore, validating the equivalence between the paper-and-pencil language test and the computer-based language test is a key step in the procedure of designing a…
Descriptors: Computer Assisted Testing, Language Tests, Test Validity, Case Studies
Colp, S. Mitchell; Nordstokke, David W. – Canadian Journal of School Psychology, 2014
Published by the Canadian Test Centre (CTC), "Insight" represents a group-administered test of cognitive functioning that has been built entirely upon the Cattell-Horn-Carroll (CHC) theoretical framework. "Insight" is intended to be administered by educators and screen entire classrooms for students who present learning…
Descriptors: Foreign Countries, Learning Disabilities, Intelligence Tests, Profiles
McCrimmon, Adam; Rostad, Kristin – Journal of Psychoeducational Assessment, 2014
This article reviews the "Autism Diagnostic Observation Schedule, Second Edition" (ADOS-2; Lord, Luyster, Gotham, & Guthrie, 2012; Lord, Rutter et al., 2012), a newly updated, semistructured, standardized measure of communication, social interaction, play/imagination, and restricted and/or repetitive behaviors published by Western…
Descriptors: Diagnostic Tests, Autism, Pervasive Developmental Disorders, Testing

Peer reviewed
Direct link
