Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Waller, Niels G. – Applied Psychological Measurement, 2008
Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…
Descriptors: Test Items, Reliability, Scores, Psychometrics
MacDougall, Margaret; Riley, Simon C.; Cameron, Helen S.; McKinstry, Brian – Journal of Applied Quantitative Methods, 2008
The authors introduce a consistency-based approach to detecting examiner bias. On comparing intra-class correlation coefficients on transformed data for supervisor continuous performance and report marks (ICC1*) with those for supervisor continuous performance and second marker report marks (ICC2*), a highly significant difference was obtained…
Descriptors: Medical Students, Attention, Correlation, Supervisors
Hargrove, Patricia; Griffer, Mona; Lund, Bonnie – Language, Speech, and Hearing Services in Schools, 2008
Purpose: This article provides information about clinical practice guidelines (CPGs) to facilitate their application to the practice of speech-language pathology. CPGs are sets of recommendations based on evidence, including expert clinical opinion, that have been developed by a panel of reviewers. In this article, CPGs are defined and their…
Descriptors: Graduate Students, Early Intervention, Interrater Reliability, Speech Language Pathology
Hendriks, A. A. Jolijn; Kuyper, Hans; Offringa, G. Johan; Van der Werf, Margaretha P. C. – Assessment, 2008
The Five-Factor Personality Inventory (FFPI) assesses a person's position on the (Dutch) psycholexically based Big Five factors: Extraversion, Agreeableness, Conscientiousness, Emotional Stability, and Autonomy. FFPI factor scores are reliable and valid if ratings are made by adults. The present study yields preliminary evidence of whether young…
Descriptors: Adolescents, Personality Measures, Validity, Reliability
Porter, Andrew C.; Polikoff, Morgan S.; Zeidner, Tim; Smithson, John – Educational Measurement: Issues and Practice, 2008
This article examines the reliability of content analyses of state student achievement tests and state content standards. We use data from two states in three grades in mathematics and English language arts and reading to explore differences by state, content area, grade level, and document type. Using a generalizability framework, we find that…
Descriptors: Content Analysis, Achievement Tests, State Standards, Academic Standards
Hagemann, Dirk; Meyerhoff, David – Structural Equation Modeling: A Multidisciplinary Journal, 2008
The latent state-trait (LST) theory is an extension of the classical test theory that allows one to decompose a test score into a true trait, a true state residual, and an error component. For practical applications, the variances of these latent variables may be estimated with standard methods of structural equation modeling (SEM). These…
Descriptors: Structural Equation Models, Test Theory, Reliability, Sample Size
Kaufman, James C.; Lee, Joohyun; Baer, John; Lee, Soonmook – Thinking Skills and Creativity, 2007
The consensual assessment technique (CAT) is a measurement tool for creativity research in which appropriate experts evaluate creative products [Amabile, T. M. (1996). "Creativity in context: Update to the social psychology of creativity." Boulder, CO: Westview]. However, the CAT is hampered by the time-consuming nature of the products (asking…
Descriptors: Creativity, Reliability, Generalizability Theory, Measurement Techniques
Parkes, Jay – Educational Measurement: Issues and Practice, 2007
Reliability consists of both important social and scientific values and methods for evidencing those values, though in practice methods are often conflated with the values. With the two distinctly understood, a reliability argument can be made that articulates the particular reliability values most relevant to the particular measurement situation…
Descriptors: Validity, Reliability, Evaluation Methods, Measurement
Amer, Aly; Al Barwani, Thuwayba; Ibrahim, Mahmoud – International Journal of Education and Development using Information and Communication Technology, 2010
Effective use of reading strategies has been recognized as an important means to increase reading comprehension. Many English as a Foreign Language (EFL) or English as a Second Language (ESL) studies have produced lists of paper-reading strategies (e.g. having a purpose for reading; using context clues). In contrast, few studies have investigated…
Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Second Language Learning
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
The No Child Left Behind Act of 2001 (NCLB, 2002) has produced an explosion of interest in the use of assessment to measure and improve student learning. Initially focused on annual state tests, educators quickly learned that results came too little and too late to identify students who were falling behind. At the same time, evidence from the…
Descriptors: Federal Legislation, Formative Evaluation, Benchmarking, Educational Assessment
Cormier, Damien C.; Altman, Jason; Shyyan, Vitaliy; Thurlow, Martha L. – National Center on Educational Outcomes, University of Minnesota, 2010
The use of accommodations for both instruction and assessment continues to be of great importance for students with disabilities. The purpose of this report is to provide an update on the state of the research on testing accommodations, as well as to identify promising areas of research likely to contribute to understanding of current and emerging…
Descriptors: Testing Accommodations, Academic Achievement, Disabilities, Educational Research
Walker, Justin – Physics Education, 2010
The benefits of using data logging to teach "how science works" are presented. Pedagogical approaches that take advantage of other school ICT are briefly described. A series of simple, quick experiments are given together with their resulting charts. Examples of the questions that arise from the charts show how the rich data lead to the refinement…
Descriptors: Science Instruction, Physics, Laboratory Equipment, Water
Neukrug, Ed; Cicchetti, Richard; Forman, Julia; Kyser, Nicole; McBride, Rebecca; Wisinger, Sharon – Journal of Computing in Higher Education, 2010
This study examined the content of email messages to the listserv "CESNET-L" in order to identify trends, common themes, and "hot topics;" to clarify its purpose; and to offer suggestions for the future of CESNET-L and similar email lists in higher education. CESNET-L is an unmoderated listserv mostly used by counselor educators and doctoral…
Descriptors: Higher Education, Interrater Reliability, Counselor Training, Counseling
Wright, Robert E. – College Student Journal, 2010
The use of standardized tests for outcome assessment has grown dramatically in recent years. Two driving factors have been the No Child Left Behind legislation, and the increase in outcome assessment measures by accrediting agencies such as AACSB, the international accrediting body for business schools. Despite the growth in usage, little effort…
Descriptors: College Outcomes Assessment, Educational Testing, Standardized Tests, Accreditation (Institutions)
Evergreen, Stephanie D. H.; Robertson, Kelly N. – Journal of MultiDisciplinary Evaluation, 2010
Background: Cultural competency is an important but under-adopted skill among professional evaluators. Yet in the transactions around job seeking and hiring in evaluation, cultural competency is a practical and common concept. How cultural competency gets communicated in those transactions may provide insights for the field. Purpose: The purpose…
Descriptors: Job Applicants, Evaluators, Cultural Awareness, Career Centers

Peer reviewed
Direct link
