Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Denson, Cameron D.; Buelin, Jennifer K.; Lammi, Matthew D.; D'Amico, Susan – Journal of Technology Education, 2015
A perceived inability to assess creative attributes of students' work has often precluded creativity instruction in the classroom. The Consensual Assessment Technique (CAT) has shown promise in a variety of domains for its potential as a valid and reliable means of creativity assessment. Relying upon an operational definition of creativity and a…
Descriptors: Creativity, Engineering Education, Engineering, Creative Thinking
Razi, Salim – SAGE Open, 2015
Similarity reports of plagiarism detectors should be approached with caution as they may not be sufficient to support allegations of plagiarism. This study developed a 50-item rubric to simplify and standardize evaluation of academic papers. In the spring semester of 2011-2012 academic year, 161 freshmen's papers at the English Language Teaching…
Descriptors: Foreign Countries, Scoring Rubrics, Writing Evaluation, Writing (Composition)
Feinberg, Richard A.; Wainer, Howard – Educational Measurement: Issues and Practice, 2014
Subscores can be of diagnostic value for tests that cover multiple underlying traits. Some items require knowledge or ability that spans more than a single trait. It is thus natural for such items to be included on more than a single subscore. Subscores only have value if they are reliable enough to justify conclusions drawn from them and if they…
Descriptors: Scores, Test Items, Reliability
Sanders, Joe Sutliff – Children's Literature in Education, 2015
A recent surge of conversation about children's nonfiction reveals a conflict between two positions that do not at first appear to be opposed: modeling inquiry and presenting authoritative facts. Tanya Lee Stone, the author of the Sibert Award-winning "Almost Astronauts" (2009), has recently alluded to that tension and expressed a…
Descriptors: Childrens Literature, Nonfiction, Authors, Inquiry
Lathrop, Quinn N. – Practical Assessment, Research & Evaluation, 2015
There are two main lines of research in estimating classification accuracy (CA) and classification consistency (CC) under Item Response Theory (IRT). The R package cacIRT provides computer implementations of both approaches in an accessible and unified framework. Even with available implementations, there remains decisions a researcher faces when…
Descriptors: Classification, Accuracy, Item Response Theory, Reliability
Stevens, Christopher John; Dascombe, Ben James – Measurement in Physical Education and Exercise Science, 2015
Sports performance testing is one of the most common and important measures used in sport science. Performance testing protocols must have high reliability to ensure any changes are not due to measurement error or inter-individual differences. High validity is also important to ensure test performance reflects true performance. Time-trial…
Descriptors: Athletics, Test Reliability, Test Validity, Testing
Chin, Huan; Chew, Cheng Meng; Lim, Hooi Lian; Thien, Lei Mee – International Journal of Science and Mathematics Education, 2022
Cognitive Diagnostic Assessment (CDA) is an alternative assessment which can give a clear picture of pupils' learning process and cognitive structures to education stakeholders so that appropriate instructional strategies can be designed to tailored pupils' needs. Coincide with this function, the Ordered Multiple-Choice (OMC) items were…
Descriptors: Mathematics Instruction, Mathematics Tests, Multiple Choice Tests, Diagnostic Tests
Srour, F. Jordan; Karkoulian, Silva – International Journal of Social Research Methodology, 2022
The literature provides multiple measures of diversity along a single demographic dimension, but when it comes to studying the interaction of multiple diversity types (e.g. age, gender, and race), the field of useable measures diminishes. We present the use of decision trees as a machine learning technique to automatically identify the…
Descriptors: Diversity, Decision Making, Artificial Intelligence, Correlation
Huang, Ting; Steinkrauss, Rasmus; Verspoor, Marjolijn – International Journal of Multilingualism, 2022
There is quite a bit of evidence showing that the experience of learning an L2 will help in learning an L3, but as far as we know, very little research has investigated the possible impact of L3 learning on the already existing and still developing L2 system within the learner. According to Complex Dynamic Systems Theory (CDST), language…
Descriptors: Multilingualism, Second Language Learning, Second Language Instruction, Transfer of Training
New York State Education Department, 2022
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Field Tests, and the Elementary-level (Grade 5) and Intermediate-level (Grade 8) Science Field Tests. School administrators must be thoroughly familiar with the…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
Mary-anne Macdonald; Eyal Gringart – Australian Journal of Indigenous Education, 2022
Research in Indigenous and non-Indigenous education in Australia over the last two decades has begun to turn towards quantitative methods of understanding various factors affecting student outcomes. The current article presents a new measurement instrument, the Multi-Dimensional Student Perceptions of School Questionnaire (MSPSQ), validated with a…
Descriptors: Test Validity, Student Attitudes, Student School Relationship, Questionnaires
Eslit, Edgar R. – Online Submission, 2023
This study examines the adaptations and experiences of college-level language learners in the post-pandemic era, providing valuable insights into the transformative nature of language learning in an evolving world. Within the framework of the socio-cultural perspective, this study explores the interplay between technology, self-directed learning,…
Descriptors: Second Language Learning, Second Language Instruction, COVID-19, Pandemics
Forrow, Lauren; Starling, Jennifer; Gill, Brian – Regional Educational Laboratory Mid-Atlantic, 2023
The Every Student Succeeds Act requires states to identify schools with low-performing student subgroups for Targeted Support and Improvement or Additional Targeted Support and Improvement. Random differences between students' true abilities and their test scores, also called measurement error, reduce the statistical reliability of the performance…
Descriptors: At Risk Students, Low Achievement, Error of Measurement, Measurement Techniques
Regional Educational Laboratory Mid-Atlantic, 2023
This Snapshot highlights key findings from a study that used Bayesian stabilization to improve the reliability (long-term stability) of subgroup proficiency measures that the Pennsylvania Department of Education (PDE) uses to identify schools for Targeted Support and Improvement (TSI) or Additional Targeted Support and Improvement (ATSI). The…
Descriptors: At Risk Students, Low Achievement, Error of Measurement, Measurement Techniques
Regional Educational Laboratory Mid-Atlantic, 2023
The "Stabilizing Subgroup Proficiency Results to Improve the Identification of Low-Performing Schools" study used Bayesian stabilization to improve the reliability (long-term stability) of subgroup proficiency measures that the Pennsylvania Department of Education (PDE) uses to identify schools for Targeted Support and Improvement (TSI)…
Descriptors: At Risk Students, Low Achievement, Error of Measurement, Measurement Techniques

Peer reviewed
Direct link
