Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedMay, Kim; Nicewander, W. Alan – Journal of Educational Measurement, 1994
Reliabilities and information functions for percentile ranks and number-right scores were compared using item response theory, modeling standardized achievement tests. Results demonstrate that situations exist in which the percentage of items known by examinees can be accurately estimated, but the percentage of persons falling below a given score…
Descriptors: Achievement Tests, Difficulty Level, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedSong, Li-yu; And Others – Psychological Assessment, 1994
Measurement fidelity (reliability, factor structure, and validity) of Aschenbach's Youth Self-Report scale was studied with 226 adolescents at a psychiatric hospital. Findings confirm convergent validity and reliability of four of the measure's seven narrowband syndromes, and seven meaningful subdimensions were extracted from the other three…
Descriptors: Adolescents, Factor Analysis, Factor Structure, Measurement Techniques
Peer reviewedReckase, Mark D. – Educational Measurement: Issues and Practice, 1995
An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)
Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models
Peer reviewedHambleton, Ronald K.; Plake, Barbara S. – Applied Measurement in Education, 1995
Several extensions to the Angoff method of standard setting are described that can accommodate characteristics of performance-based assessment. A study involving 12 panelists supported the effectiveness of the new approach but suggested that panelists preferred an approach that was at least partially conjunctive. (SLD)
Descriptors: Educational Assessment, Evaluation Methods, Evaluators, Interrater Reliability
Feldman, Susan E. – Searcher, 1995
Examines the needs of Internet users, reviews the responses to these needs, and discusses the role information professionals play in using and organizing online information. User needs include: ease of use, compatibility, reliability, integration of Internet software, stability, affordability, centralized information-finding mechanisms, quality…
Descriptors: Computer Security, Computer Software, Information Scientists, Needs Assessment
Peer reviewedHigbee, Katherine R.; Roberts, Robert E. – Hispanic Journal of Behavioral Sciences, 1994
Eight-item revision of the UCLA Loneliness Scale was administered to 2,614 students, aged 11-14. Loneliness did not differ by age or between Anglo- and Mexican-American students, but was higher for girls than boys in each ethnic group. Principal components factor analysis and correlations with other related measures indicate good reliability and…
Descriptors: Affective Measures, Anglo Americans, Early Adolescents, Loneliness
Peer reviewedStumpf, Steven H. – Evaluation and the Health Professions, 1994
A five-year curriculum evaluation project is described that treated students' course ratings, examination reliability coefficients, and item-discrimination data as a battery of data points for determining annual revision efforts. Histograms were constructed to make valid demonstrations of successful efforts immediately comprehensible to faculty.…
Descriptors: College Faculty, Comprehension, Curriculum Evaluation, Longitudinal Studies
Peer reviewedAntonak, Richard F.; Larrivee, Barbara – Exceptional Children, 1995
Evidence supporting the use of a revision of the Opinions Relative to Mainstreaming scale, called Opinions Relative to Integration of Students with Disabilities, is presented. Scale testing with 376 professionals revealed satisfactory item characteristics, adequate reliability and homogeneity, and initial support for construct validity. The scale…
Descriptors: Attitude Measures, Disabilities, Elementary Secondary Education, Inclusive Schools
Peer reviewedDeBono, Kenneth G.; Snyder, Mark – Personality and Social Psychology Bulletin, 1995
Three investigations examined the contributions of a history of choosing attitudinally relevant situations to attitude-behavior relations. Results point to an interrelated set of mechanisms, such as behavior, by which situational choice is linked to attitude-behavior relations. By choosing attitudinally relevant situations, individuals increase…
Descriptors: Attitudes, Behavior, Behavior Development, Motivation
Peer reviewedGunn, Pat; Cuskelly, Monica – International Journal of Disability, Development and Education, 1991
Behavioral ratings by mothers and teachers of 94 children with Down's Syndrome (between 8 and 14 years of age) indicated general support for the amiable personality stereotype, but ratings of low persistence were associated with maternal impressions of difficulty. There was little agreement between mothers and teachers regarding individual child…
Descriptors: Adolescents, Behavior Problems, Children, Downs Syndrome
Peer reviewedChilders, Thomas; And Others – RQ, 1991
Reports on a project which developed a measure of levels of difficulty of reference questions handled by the California Interlibrary Reference Referral Network, and identified indicative measures that would reliably stand for the concept of difficulty. Correlations of predictive difficulty and actual difficulty are discussed. (nine references)…
Descriptors: Correlation, Difficulty Level, Library Networks, Library Services
Peer reviewedAllan, Alistair – Language Testing, 1992
The design of a valid and reliable test of test-wiseness is reported: a 33-item multiple-choice instrument with 4 subscales trialed with several groups of English-as-a-Second-Language students. Findings indicate differential skills in test-taking; some learner scores are influenced by skills that are not the focus of the test. (13 references)…
Descriptors: English (Second Language), Language Research, Language Tests, Multiple Choice Tests
Peer reviewedNorcini, John; Shea, Judy – Applied Measurement in Education, 1992
Two studies involving a total of 99 experts examined the reproducibility of standards for 2 medical certifying examinations set under different conditions. Together, results of both studies provide evidence that a modified version of the Angoff method is quite reliable and produces stable results under varying conditions. (SLD)
Descriptors: Academic Standards, Evaluators, Groups, Higher Education
Peer reviewedMartin, Mike – CD-ROM Professional, 1993
Discusses studies of CD-ROM life span done by 3M and the Federal Special Interest Group for CD-ROM Application and Technology (SIGCAT) and concludes that CD-ROMs have long life spans. A table presents test results, and industry specifications for CD-ROM discs and drivers are examined. (EA)
Descriptors: Disk Drives, Evaluation Methods, Industry, Optical Data Disks
Realmuto, George M.; Wescoe, Sibyl – Child Abuse and Neglect: The International Journal, 1992
This study investigated whether 13 young children presented with anatomically correct dolls would exhibit behaviors that professionals (n=14) could agree on to determine the child's abuse status. The study concluded that experienced professionals agree poorly with each other about a child's abuse status and that sexually anatomically correct dolls…
Descriptors: Behavior Patterns, Child Abuse, Clinical Diagnosis, Evaluation Methods


