Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedPope, Raechele L.; Mueller, John A. – Journal of College Student Development, 2000
Discusses the development of the Multicultural Competence in Student Affairs-Preliminary 2 (MCSA-P2) Scale, an assessment tool to measure multicultural competence in a higher education context. Reports the results of two studies that investigated the validity and reliability of the MCSA-P2. Explores future research needs as well as applications…
Descriptors: Cultural Pluralism, Evaluation Methods, Higher Education, Measurement Techniques
Peer reviewedCoil, Carolyn – International Schools Journal, 2000
States that the Internet is one of the most commonly used methods of obtaining information for today's students. Provides guidelines for using the Internet as a reliable source of credible and accurate information. Lists nine general guidelines for reliability of e-mail sources, and 11 for web sources. (CW)
Descriptors: Educational Technology, Higher Education, Information Sources, Information Utilization
Peer reviewedJohnson, Robert L.; McDaniel, Fred, II; Willeke, Marjorie J. – American Journal of Evaluation, 2000
Studied the interrater reliability of a portfolio assessment used in a small-scale program evaluation. Investigated analytic, combined analytic, and holistic family literacy portfolios from an Even Start program. Results show that at least three raters are needed to obtain acceptable levels of reliability for holistic and individual analytic…
Descriptors: Family Literacy, Holistic Approach, Interrater Reliability, Portfolio Assessment
Peer reviewedAndre, Kate – Nurse Education Today, 2000
Although assessment of nursing students' clinical performance is difficult, objections to grading may be based on erroneous assumptions. A combination of criterion- and norm-referenced assessment can clarify minimum competency requirements and reward meritorious performance. (SK)
Descriptors: Clinical Experience, Competence, Criterion Referenced Tests, Foreign Countries
Peer reviewedStagnitti, Karen; Unsworth, Carolyn; Rodger, Sylvia – Canadian Journal of Occupational Therapy, 2000
A study of 82 preschoolers determined that a new play assessment (Child-Initiated Pretend Play Assessment), which identifies cognitive play skills, possessed acceptable interrater reliability and could discriminate between the play of typically developing preschoolers and those with preacademic problems. (Contains 65 references.) (JOW)
Descriptors: Behavior Problems, Cognitive Measurement, Interrater Reliability, Measures (Individuals)
Peer reviewedGordon Rouse, Kimberly A.; Cashin, Susan E. – Measurement and Evaluation in Counseling and Development, 2000
Examines the reliability and validity of the scores of the Assessment of Academic Self-Concept and Motivation Scale (AASCM), a new instrument developed according to motivational systems theory. The AASCM was administered to African American, European American, and Hispanic participants (N=492). Presents evidence of the measurement's strong…
Descriptors: College Students, Cross Cultural Studies, Evaluation Methods, Higher Education
Mudford, Oliver C.; Hogg, James; Roberts, Jessie – American Journal on Mental Retardation, 1999
A study attempted to replicate a previous study that presented reliability data from recordings of behavior state using a 13-category coding system. Replication was unsuccessful. Obtained mean percentage agreement on occurrence for individual behavior state and participants (n=34) ranged across observer pairs from 0 to 58 percent. (Contains 13…
Descriptors: Adults, Behavior Patterns, Behavior Problems, Behavior Rating Scales
Peer reviewedReid, Dennis H.; Parsons, Marsha B.; Green, Carolyn W. – Journal of Applied Behavior Analysis, 1998
A study evaluated a prework assignment for predicting work-task preferences among three adults with severe multiple disabilities prior to supported employment. The assessment compared worker selections from pairs of work tasks drawn from their future job duties. The assessment accurately predicted tasks that the workers preferred to work on.…
Descriptors: Adults, Job Performance, Severe Disabilities, Supported Employment
Peer reviewedDavidowitz, Bette; Lubben, Fred; Rollnick, Marissa – Journal of Chemical Education, 2001
Investigates students' conceptions on reliability and the ways of dealing with different sets of experimental data. Tests students' understanding of how to handle experimental data from three aspects; doing replicates, handling data, and judging the quality of data with respect to spread. Includes 12 references. (YDS)
Descriptors: Chemical Engineering, Chemistry, Data, Foreign Countries
Peer reviewedBurns, Mathew K. – Psychology in the Schools, 2002
Study reviewed measures that can be used for personality assessment with the Referral Question Consultation (RQC). Each review addressed the reliability of the scales and composite scores; validity; usefulness of results for planning interventions; empirical basis for analyses of individual items; and whether each item accounted for response…
Descriptors: Adolescents, Children, Counseling Techniques, Measurement
Carlson, Scott – Chronicle of Higher Education, 2002
Discusses how a Supreme Court decision last year in "The New York Times Company v. Jonathan Tasini" has led publishers to make massive purges of archival material in newspaper databases, rendering them unreliable for many scholars. (EV)
Descriptors: Court Litigation, Electronic Libraries, Full Text Databases, Higher Education
Peer reviewedHeylen, Louis; Wuyts, Floris L.; Mertens, Fons; De Bodt, Marc; Pattyn, Jos; Croux, Christophe; Van de Heyning, Paul H. – Journal of Speech, Language, and Hearing Research, 1998
Voice range profiles (VRP) were analyzed according to 11 frequency, intensity, and morphological characteristics for 94 typical children and 136 children with vocal fold pathologies (ages 6-11). Normative data are presented showing marked differences between the groups. The use of the VRP Index for Children for screening is discussed. (Author/CR)
Descriptors: Children, Disability Identification, Elementary Education, Screening Tests
Peer reviewedNicewander, W. Alan; Thomasson, Gary L. – Applied Psychological Measurement, 1999
Derives three reliability estimates for the Bayes modal estimate (BME) and the maximum-likelihood estimate (MLE) of theta in computerized adaptive tests (CATs). Computes the three reliability estimates and the true reliabilities of both BME and MLE for seven simulated CATs. Results show the true reliabilities for BME and MLE to be nearly identical…
Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing
Peer reviewedHayes, John R.; Hatch, Jill A.; Silk, Christine M. – Written Communication, 2000
Analyzes approximately 4,800 independent evaluations of 796 essays written by 241 students in 13 first-year writing classes at two colleges. Finds very low consistency of holistically scored student performance from essay to essay, suggesting that drawing conclusions from one or even a few writing samples of a particular student is problematic.…
Descriptors: Evaluation Problems, Higher Education, Holistic Evaluation, Reliability
Peer reviewedGuerette, Paula; Tefft, Donita; Furumasu, Jan; Moy, Fabiola – Infant-Toddler Intervention: The Transdisciplinary Journal, 1999
This study developed a test battery to assess the cognitive skills in children with physical limitations. A preliminary battery of 83 items was administered to 26 children, aged 26 to 36 months, with severe physical impairments. Rasch analysis yielded a final battery of 35 items with high internal consistency, interrater reliability, and…
Descriptors: Behavior Rating Scales, Cognitive Development, Cognitive Tests, Physical Disabilities


