Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Miles, Eleanor; Sheeran, Paschal; Webb, Thomas L. – Psychological Bulletin, 2013
Augustine and Hemenover (2013) were right to state that meta-analyses should be accurate and generalizable. However, we disagree that our meta-analysis of emotion regulation strategies (Webb, Miles, & Sheeran, 2012) fell short in these respects. Augustine and Hemenover's concerns appear to have accrued from misunderstandings of our inclusion…
Descriptors: Effect Size, Meta Analysis, Accuracy, Self Control
Newton, Paul E. – Oxford Review of Education, 2013
In May 2008, Ofqual established a two-year programme of research to investigate the nature and extent of (un)reliability within the qualifications, examinations and assessments that it regulated. It was particularly concerned to improve understanding of, and confidence in, this technically complex and politically sensitive phenomenon. The…
Descriptors: Foreign Countries, Reliability, Educational Assessment, Case Studies
Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013
This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…
Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability
Ferrando, Pere J.; Anguiano-Carrasco, Cristina; Demestre, Josep – Structural Equation Modeling: A Multidisciplinary Journal, 2013
This article proposes a model-based procedure, intended for personality measures, for exploiting the auxiliary information provided by the certainty with which individuals answer every item (response certainty). This information is used to (a) obtain more accurate estimates of individual trait levels, and (b) provide a more detailed assessment of…
Descriptors: Structural Equation Models, Item Response Theory, Personality Measures, Goodness of Fit
Tseng, Andy – ProQuest LLC, 2013
Previous studies have reported that African-American students attending HBCUs consume less alcohol and experience fewer negative consequences compared to students attending other public colleges and universities (Fowler, 2001). Factors such as religion (Kapner, 2008), alcohol-free campus policies (Wechsler, Lee, Gledhill-Hoyt, & Nelson, 2001),…
Descriptors: College Students, African American Students, White Students, Drinking
Chae, Ki Byung – ProQuest LLC, 2013
The review of current supervision models and instruments revealed a crucial need for a valid, reliable instrument that assesses the quality of the supervision environment as a venue for promoting counselor development. Therefore, the primary purpose of this study was the construction and initial validation of the Chae Optimal Supervision…
Descriptors: Test Construction, Supervision, Tests, Test Validity
Baser, Mustafa – Online Submission, 2013
The aim of this research was to explore the relationship among students' attitudes toward programming, gender and academic achievement in programming. The scale used for measuring students' attitudes toward programming was developed by the researcher and consisted of 35 five-point Likert type items in four subscales. The scale was administered to…
Descriptors: Undergraduate Students, Student Attitudes, Computer Attitudes, Gender Differences
López-López, José Antonio; Botella, Juan; Sánchez-Meca, Julio; Marín-Martínez, Fulgencio – Journal of Educational and Behavioral Statistics, 2013
Since heterogeneity between reliability coefficients is usually found in reliability generalization studies, moderator analyses constitute a crucial step for that meta-analytic approach. In this study, different procedures for conducting mixed-effects meta-regression analyses were compared. Specifically, four transformation methods for the…
Descriptors: Reliability, Generalization, Meta Analysis, Regression (Statistics)
Thissen-Roe, Anne; Thissen, David – Journal of Educational and Behavioral Statistics, 2013
Extreme response set, the tendency to prefer the lowest or highest response option when confronted with a Likert-type response scale, can lead to misfit of item response models such as the generalized partial credit model. Recently, a series of intrinsically multidimensional item response models have been hypothesized, wherein tendency toward…
Descriptors: Likert Scales, Responses, Item Response Theory, Models
Yeh, Stuart S. – Teachers College Record, 2013
Background: In principle, value-added modeling (VAM) might be justified if it can be shown to be a more reliable indicator of teacher quality than existing indicators for existing low-stakes decisions that are already being made, such as the award of small merit bonuses. However, a growing number of researchers now advocate the use of VAM to…
Descriptors: Teacher Effectiveness, Academic Achievement, Teacher Placement, Teacher Dismissal
Cheng, Liying; Fox, Janna – Language Teaching, 2013
This paper reviews a selected sample of 24 doctoral dissertations in language assessment (broadly defined), completed between 2006 and 2011 in Canadian universities. These dissertations fall into five thematic categories: 1) reliability, validity and factors affecting test performance; 2) washback (impact) and ethics; 3) raters, rating and rating…
Descriptors: Foreign Countries, Doctoral Dissertations, Mixed Methods Research, Language Research
Campbell, Heather; Espin, Christine A.; McMaster, Kristen – Reading and Writing: An Interdisciplinary Journal, 2013
The purpose of this study was to examine the validity and reliability of Curriculum-Based Measures in writing for English learners. Participants were 36 high school English learners with moderate to high levels of English language proficiency. Predictor variables were type of writing prompt (picture, narrative, and expository), time (3, 5, and 7…
Descriptors: Curriculum Based Assessment, Writing Tests, Test Validity, Test Reliability
Chen, Ssu-Kuang; Hwang, Fang-Ming; Lin, Sunny S. J. – Social Indicators Research, 2013
A scale measuring quality of life (QOL) is important in adolescent research. Using the graded response model (GRM), this study evaluates the psychometric properties of the satisfaction ratings of the Quality of Life Profile Adolescent Version (QOLPAV). Data for 1,392 adolescents were used to check IRT assumptions such as unidimensionality and…
Descriptors: Quality of Life, Measures (Individuals), Life Satisfaction, Adolescents
Gustafsson, Jan-Eric; Erickson, Gudrun – Educational Assessment, Evaluation and Accountability, 2013
In the Swedish educational system, teachers have the dual responsibility of assigning final grades and marking their own students' national tests. The Government has mandated the Swedish Schools Inspectorate to remark samples of the national tests to see if teacher marking can be trusted. Reports from this project have concluded that intermarker…
Descriptors: Logical Thinking, Student Evaluation, Inferences, Trust (Psychology)
Ashley, Seth; Maksl, Adam; Craft, Stephanie – Journalism and Mass Communication Educator, 2013
Using a framework previously applied to other areas of media literacy, this study developed and assessed a measurement scale focused specifically on critical news media literacy. Our scale appears to successfully measure news media literacy as we have conceptualized it based on previous research, demonstrated through assessments of content,…
Descriptors: News Media, Role, Democracy, Citizenship Education

Peer reviewed
Direct link
