Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Ali, Syed Haris; Carr, Patrick A.; Ruit, Kenneth G. – Journal of the Scholarship of Teaching and Learning, 2016
Plausible distractors are important for accurate measurement of knowledge via multiple-choice questions (MCQs). This study demonstrates the impact of higher distractor functioning on validity and reliability of scores obtained on MCQs. Freeresponse (FR) and MCQ versions of a neurohistology practice exam were given to four cohorts of Year 1 medical…
Descriptors: Scores, Multiple Choice Tests, Test Reliability, Test Validity
Yasar, Erkan; Gurel, Cem – European Journal of Physics Education, 2016
It is aimed in this research to measure via knowledge hierarchy the things regarding exhibit themes learned by the visitors of the exhibits and compare them with the purpose that the exhibits are designed for, thereby realizing a summative evaluation of the exhibits by knowledge hierarchy method. The research has been conducted in a children's…
Descriptors: Science Education, Museums, Exhibits, Summative Evaluation
Ulum, Ömer Gökhan – Online Submission, 2016
The aim of this study is to evaluate a state high school EFL Program through CIPP (context, input, process and product) model. The participants of the study include 504 students. The source of data has been obtained through a 46-itemed questionnaire and an interview for the students. In the study, the data has been analysed using statistical…
Descriptors: Program Evaluation, English (Second Language), Second Language Learning, Second Language Instruction
Dean, Lynn M. – ProQuest LLC, 2016
How parents interact with their children impacts many crucial facets of children's lives. Over the last 4 decades, researchers have identified four different parenting styles: authoritative, authoritarian, permissive, and disengaged. Hundreds of studies conducted all over the world, have identified correlations between parenting style and many…
Descriptors: Parent Attitudes, Questionnaires, Reliability, Generalization
August, Diane; Slama, Rachel – Office of English Language Acquisition, US Department of Education, 2016
This literature review begins with a description of the process used to conduct the literature review, parameters for the review, and the characterization of the literature. The body of this review consists of four sections: (1) Development and/or Adoption of State English Language Proficiency (ELP) Standards, (2) Design and Development of the ELP…
Descriptors: Accountability, English Language Learners, State Standards, Language Proficiency
Pinder, Patrice Juliet – Online Submission, 2020
States are establishing high stakes assessments to serve as measurement tools of students' academic abilities. This study essentially compares Maryland's and Florida's mathematics and science assessments for similarities and differences. Building from 5-10 years of student level quantitative data (secondary data) and critical analyses of the…
Descriptors: Standardized Tests, Achievement Tests, State Standards, High Stakes Tests
Miller, Angie L. – Creativity Research Journal, 2014
This study sought to explore creative cognitive processes and the similarities and differences in how descriptions of these processes group together in various self-report subscales. Based on empirical evidence from numerous studies involving the cognitive components of creativity training, the Cognitive Processes Associated with Creativity (CPAC)…
Descriptors: Cognitive Processes, Creativity, Creative Thinking, Validity
Medrano, Leonardo Adrian; Liporace, Mercedes Fernandez; Perez, Edgardo – Electronic Journal of Research in Educational Psychology, 2014
Introduction: Computerized tests have become one of the most widely used and efficient educational assessment methods. Increasing efforts to generate computerized assessment systems to identify students at risk for drop out have been recently noted. An important variable influencing student retention is academic satisfaction. Accordingly, the…
Descriptors: Computer Assisted Testing, Satisfaction, College Freshmen, Self Efficacy
Cheng, Sanyin; Zhang, Li-Fang – American Annals of the Deaf, 2014
The present study pioneered in adopting test accommodations to validate the Thinking Styles Inventory-Revised II (TSI-R2; Sternberg, Wagner, & Zhang, 2007) among Chinese university students with hearing impairment. A series of three studies were conducted that drew their samples from the same two universities, in which accommodating test…
Descriptors: Foreign Countries, Testing Accommodations, Hearing Impairments, College Students
Sink, Christopher A.; Bultsma, Shawn A. – Measurement and Evaluation in Counseling and Development, 2014
The psychometric properties of the Life Perspectives Inventory (LPI-English language version), a new instrument designed to assess characteristics associated with nonreligious spirituality in high school-age adolescents, were examined in two phases. Phase 1 demonstrated the survey's factorial validity and internal consistency and the test-retest…
Descriptors: Measures (Individuals), Adolescents, Religious Factors, High School Students
Iovannone, Rose; Greenbaum, Paul E.; Wang, Wei; Dunlap, Glen; Kincaid, Don – Assessment for Effective Intervention, 2014
Data assessment is critical for determining student behavior change in response to individualized behavior interventions in schools. This study examined the interrater agreement of the Individualized Behavior Rating Scale Tool (IBRST), a perceptual direct behavior rating tool that was used by typical school personnel to record behavior occurrence…
Descriptors: Behavior Rating Scales, Interrater Reliability, Student Behavior, Behavior Problems
Loats, Jim; White, Diana; Rubino, Carmen – PRIMUS, 2014
We provide in-depth information and analysis on three activities for use in History of Mathematics courses taught either in a traditional semester format for undergraduates or in a summer professional development course for middle school teachers. These activities require students to be active participants in their own learning. They also…
Descriptors: Undergraduate Study, College Mathematics, Mathematics Instruction, History
Beaujean, A. Alexander – Practical Assessment, Research & Evaluation, 2014
A common question asked by researchers using regression models is, What sample size is needed for my study? While there are formulae to estimate sample sizes, their assumptions are often not met in the collected data. A more realistic approach to sample size determination requires more information such as the model of interest, strength of the…
Descriptors: Regression (Statistics), Sample Size, Sampling, Monte Carlo Methods
Nadelson, Louis; Jorcyk, Cheryl; Yang, Dazhi; Jarratt Smith, Mary; Matson, Sam; Cornell, Ken; Husting, Virginia – School Science and Mathematics, 2014
Trust in science and scientists can greatly influence consideration of scientific developments and activities. Yet, trust is a nebulous construct based on emotions, knowledge, beliefs, and relationships. As we explored the literature regarding trust in science and scientists we discovered that no instruments were available to assess the construct,…
Descriptors: Trust (Psychology), Test Construction, College Faculty, Undergraduate Students
Meyer, J. Patrick; Liu, Xiang; Mashburn, Andrew J. – Educational and Psychological Measurement, 2014
Researchers often use generalizability theory to estimate relative error variance and reliability in teaching observation measures. They also use it to plan future studies and design the best possible measurement procedures. However, designing the best possible measurement procedure comes at a cost, and researchers must stay within their budget…
Descriptors: Reliability, Classroom Observation Techniques, Generalizability Theory, Error of Measurement

Peer reviewed
Direct link
