Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
McCurry, Doug – Assessing Writing, 2010
This article considers the claim that machine scoring of writing test responses agrees with human readers as much as humans agree with other humans. These claims about the reliability of machine scoring of writing are usually based on specific and constrained writing tasks, and there is reason for asking whether machine scoring of writing requires…
Descriptors: Writing Tests, Scoring, Interrater Reliability, Computer Assisted Testing
Predictors of Bullying and Victimization in Childhood and Adolescence: A Meta-Analytic Investigation
Cook, Clayton R.; Williams, Kirk R.; Guerra, Nancy G.; Kim, Tia E.; Sadek, Shelly – School Psychology Quarterly, 2010
Research on the predictors of 3 bully status groups (bullies, victims, and bully victims) for school-age children and adolescents was synthesized using meta-analytic procedures. The primary purpose was to determine the relative strength of individual and contextual predictors to identify targets for prevention and intervention. Age and how…
Descriptors: Intervention, Bullying, Prevention, Victims of Crime
Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010
Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…
Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing
Kubik, Joyce A. – Journal of Attention Disorders, 2010
Objective: This is perhaps the first outcome study on the efficacy of ADHD coaching for adults with ADHD and its long-term effect. Method: Forty-five adults (30 women, 15 men) rated 22 areas of concern before and after the coaching experience. Factor analysis of the 22 areas of concern revealed five factors. Descriptive statistics and…
Descriptors: Attention Deficit Hyperactivity Disorder, Factor Analysis, Statistical Analysis, Correlation
Newman, Diana B. – Communication Disorders Quarterly, 2010
This investigation examined the listening comprehension (LC) performance of two groups of adolescent struggling readers, one group with word-finding difficulties (WFD) and one with no word-finding difficulties (NWFD). Of interest was whether the expressive language difficulties of the WFD group would interfere with their success on a LC assessment…
Descriptors: Listening Comprehension, Adolescents, Expressive Language, Listening Comprehension Tests
Roszkowski, Michael J.; Soven, Margot – Assessment & Evaluation in Higher Education, 2010
A questionnaire used in student evaluations of interdisciplinary courses during six semesters contained two Likert items stated in a direct negative mode which were embedded in a questionnaire (14-18 items) in which the remaining items were phrased in a direct positive mode. In the seventh semester and thereafter, the two negative items were…
Descriptors: Questionnaires, Student Evaluation, Likert Scales, Test Construction
Eyler, Amy A.; Brownson, Ross C.; Aytur, Semra A.; Cradock, Angie L.; Doescher, Mark; Evenson, Kelly R.; Kerr, Jacqueline; Maddock, Jay; Pluto, Delores L.; Steinman, Lesley; Tompkins, Nancy O'Hara; Troped, Philip; Schmid, Thomas L. – Journal of School Health, 2010
Objectives: To develop a comprehensive inventory of state physical education (PE) legislation, examine trends in bill introduction, and compare bill factors. Methods: State PE legislation from January 2001 to July 2007 was identified using a legislative database. Analysis included components of evidence-based school PE from the Community Guide and…
Descriptors: Physical Education, Teacher Certification, Content Analysis, Trend Analysis
Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010
This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…
Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores
Pesman, Haki; Eryilmaz, Ali – Journal of Educational Research, 2010
The authors aimed to propose a valid and reliable diagnostic instrument by developing a three-tier test on simple electric circuits. Based on findings from the interviews, open-ended questions, and the related literature, the test was developed and administered to 124 high school students. In addition to some qualitative techniques for…
Descriptors: Misconceptions, Diagnostic Tests, Psychometrics, Physics
Topcu, Mustafa Sami – Evaluation & Research in Education, 2010
This study aimed to develop and validate the Attitudes towards Socioscientific Issues Scale (ATSIS) for undergraduate students. In the first step, data were collected from 160 undergraduate students from the departments of science education and elementary education to provide validity of the scale. In light of the results of an exploratory factor…
Descriptors: Science and Society, Attitude Measures, Student Attitudes, Undergraduate Students
Kocakulah, Mustafa Sabri – Journal of Science Education and Technology, 2010
This study aims to develop and apply a rubric to evaluate the solutions of pre-service primary science teachers to questions about Newton's Laws of Motion. Two groups were taught the topic using the same teaching methods and administered four questions before and after teaching. Furthermore, 76 students in the experiment group were instructed…
Descriptors: Control Groups, Scientific Concepts, Academic Achievement, Motion
Neuman, S.B.; Koh, S.; Dwyer, J. – Early Childhood Research Quarterly, 2008
The purpose of this study was to develop a valid and reliable tool for measuring the quality of the language and literacy environment in home-based settings. Based on a convergence of research on the ecological and psychological factors associated with early literacy development, the Child/Home Environmental Language and Literacy Observation…
Descriptors: Observation, Interrater Reliability, Urban Areas, Psychometrics
Gray, K. M.; Tonge, B. J.; Sweeney, D. J.; Einfeld, S. L. – Journal of Autism and Developmental Disorders, 2008
The ability to identify children who require specialist assessment for the possibility of autism at as early an age as possible has become a growing area of research. A number of measures have been developed as potential screening tools for autism. The reliability and validity of one of these measures for screening for autism in young children…
Descriptors: Check Lists, Autism, Interrater Reliability, Young Children
O'Sullivan, Maureen – Psychological Bulletin, 2008
In 2006, C. F. Bond Jr. and B. M. DePaulo provided a meta-analysis of means and concluded that average lie detection accuracy was significantly greater than chance for most people. Now, they have presented an analysis of standard deviations (C. F. Bond Jr. & B. M. DePaulo, 2008), claiming that there are no reliable individual differences in lie…
Descriptors: Deception, Test Theory, Meta Analysis, Individual Differences
Ratcliff, Roger; Schmiedek, Florian; McKoon, Gail – Intelligence, 2008
The worst performance rule for cognitive tasks [Coyle, T.R. (2003). IQ, the worst performance rule, and Spearman's law: A reanalysis and extension. "Intelligence," 31, 567-587] in which reaction time is measured is the result that IQ scores correlate better with longer (i.e., 0.7 and 0.9 quantile) reaction times than shorter (i.e., 0.1 and 0.3…
Descriptors: Reaction Time, Intelligence Quotient, Correlation, Models

Peer reviewed
Direct link
