Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Muijs, Daniel – SAGE Publications, 2004
This book looks at quantitative research methods in education. The book is structured to start with chapters on conceptual issues and designing quantitative research studies before going on to data analysis. While each chapter can be studied separately, a better understanding will be reached by reading the book sequentially. This book is intended…
Descriptors: Multivariate Analysis, Multiple Regression Analysis, Correlation, Educational Research
Using Multiple Raters on Performance Based Driving Tests with High School Driver Education Students.
Haueisen, Heidi L. – 2001
An assessment tool was designed and implemented to increase consistent application among and between multiple raters assessing students in driver education. The targeted population was students in grades 9 through 12 enrolled in drive education at a high school in an affluent suburb near a large city. The problem of a lack of a consistent…
Descriptors: Driver Education, High School Students, High Schools, Interrater Reliability
Miller, M. David – 2002
In 1994 the State Collaborative on Assessment and Student Standards of the Council of Chief State School Officers began a study to examine the generalizability of performance-based assessments (PBAs) for state-mandated assessment programs. The intent was to examine the major sources of error associated with PBAs and the generalizability and…
Descriptors: Elementary Secondary Education, Error of Measurement, Generalizability Theory, Performance Based Assessment
Gardner, John; Cowan, Pamela – 2000
The Transfer Procedure Test is taken by children around 11 years of age who wish to attend grammar schools in Northern Ireland. It is a high stakes test in that children are only allowed one attempt and their performance determines their future schooling in a manner that is not of their choice or of their parents. Candidates usually take two test…
Descriptors: Admission (School), Elementary Secondary Education, Foreign Countries, High Stakes Tests
Dickinson, David K.; McCabe, Allyssa; Sprague, Kim – 2001
The Teacher Rating of Oral Language and Literacy (TROLL) is an instrument that measures skills identified as critical in the New Standards for Speaking and Listening. In 5 to 10 minutes and without prior training, teachers can assess an individual child's current standing with respect to skills that research has identified as critical for literary…
Descriptors: Early Childhood Education, Language Skills, Language Tests, Literacy
Basturk, Ramazan; Loadman, William E. – Online Submission, 2001
Purpose: The purpose of this study is to assess and evaluate the grant selection process for reading excellence program in Ohio. School districts in Ohio were given the opportunity to apply for funding to support district based reading programs through a request for proposal procedures. An effort was made to reliably and equitably score the…
Descriptors: Reading Programs, Tutors, Statistics, Grants
McQueen, Joy; Congdon, Peter J. – 1997
A study was conducted to investigate the stability of rater severity over an extended rating period. Multifaceted Rasch analysis was applied to ratings of writing performances of 8,285 primary school (elementary) students. Each performance was rated on two performance dimensions by two trained raters over a period of 7 rating days. Performances…
Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Foreign Countries
Yorke, Mantz – 1997
This paper examines the purpose, validity, and reliability of performance indicators in higher education, focusing on the experience of higher education institutions in the United Kingdom. Specifically, it studies the role of student entry and exit performance, teaching and staff quality, retention and completion, and placement in employment as…
Descriptors: Educational Policy, Foreign Countries, Higher Education, Institutional Evaluation
Scheuren, Fritz; Li, Bonnie – 1996
This report provides empirical results of attempts to achieve consistency of estimates between two National Center for Education Statistics (NCES) surveys, the 1993-94 Private School Survey (PSS) and the Schools and Staffing Survey (SASS). Comparisons are made among statistical and computational procedures that may achieve the desired consistency…
Descriptors: Classification, Elementary Secondary Education, Estimation (Mathematics), Least Squares Statistics
Edman, Laird R. O.; Bart, William M.; Robey, Jennifer; Silverman, Jenzi – 2000
The Minnesota Test of Critical Thinking (MTCT) has been designed to measure both critical thinking (CT) skills and a key disposition of critical reasoning: the willingness to evaluate arguments that are congruent with one's own goals and beliefs critically. The MTCT uses a taxonomy of CT skills derived from the American Philosophical Association's…
Descriptors: Critical Thinking, Factor Analysis, Factor Structure, Higher Education
Sciutto, Mark J.; Terjesen, Mark D. – 2000
This study examined the psychometric and technical characteristics of various measures of attention deficit hyperactivity disorder (ADHD) that are commonly used with preschool-aged children. Information on reliability, validity, norms, and scale-specific features was gathered from the test manuals of four commonly used behavior rating scales: (1)…
Descriptors: Attention Deficit Disorders, Diagnostic Tests, Hyperactivity, Norms
Bastick, Tony – 1999
The purpose of this paper is to report a successful technique for assessing cooperative group work reliably and validly. The paper demonstrates a simple-to-use assessment procedure that tracks individual accountability, energizes student interaction, and rewards cooperative learning, even as it uses fewer administrative resources than traditional…
Descriptors: Accountability, Cooperative Learning, Criteria, Evaluation Methods
Bastick, Tony – 1999
This paper aims to make the techniques of cooperative learning more attractive to teachers by presenting a method of assessment that avoids the drawbacks associated with trying to extract valid and reliable individual marks from cooperative performances. The paper presents an easy-to-use method of assessing an individual's contribution to a…
Descriptors: Accountability, Cooperative Learning, Criteria, Evaluation Methods
Crehan, Kevin D.; Hess, Robert K.; D'Agostino, Jerome V. – 2000
This paper focuses on teacher testing issues related to job analysis, test specification development, reliability, and validity. It emphasizes the conceptualization and operational definition of appropriate validity evidence to assess the quality of licensure testing decisions. It is suggested that the process of job, or practice, analysis would…
Descriptors: Cognitive Processes, Job Analysis, Licensing Examinations (Professions), Reliability
Floreck, Lisa M.; De Champlain, Andre F.; Kaplan, David – 2001
The purpose of the current study was to use multilevel modeling to quantify and explain the sources of score variation in standardized patient (SP) encounters. Through laypersons trained to portray SPs and record medical student actions, SP examinations allow the measurement of examinees' clinical and interpersonal skills. In this study, the SP…
Descriptors: Clinical Experience, Computer Software, Licensing Examinations (Professions), Patients

Direct link
