Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedJohnson, Robert L.; Penny, James; Gordon, Belita – Applied Measurement in Education, 2000
Studied four forms of score resolution used by testing agencies and investigated the effect that each has on the interrater reliability associated with the resulting operational scores. Results, based on 120 essays from the Georgia High School Writing Test, show some forms of resolution to be associated with higher reliability and some associated…
Descriptors: Essay Tests, High School Students, High Schools, Interrater Reliability
Peer reviewedBurton, Richard F.; Miller, David J. – Assessment & Evaluation in Higher Education, 1999
Discusses statistical procedures for increasing test unreliability due to guessing in multiple choice and true/false tests. Proposes two new measures of test unreliability: one concerned with resolution of defined levels of knowledge and the other with the probability of examinees being incorrectly ranked. Both models are based on the binomial…
Descriptors: Guessing (Tests), Higher Education, Multiple Choice Tests, Objective Tests
Feldhusen, John F.; Jin, Suk-un – Understanding Our Gifted, 2000
This article describes the Gifted Education Knowledge Scale, a rating scale for assessing educators' knowledge about gifted education. Information about item selection, content validity, reliability, and internal consistence is provided. (DB)
Descriptors: Elementary Secondary Education, Gifted, Knowledge Base for Teaching, Psychometrics
Peer reviewedRemington, Bob – Journal of Intellectual and Developmental Disability, 1998
This evaluative review describes the history of applied behavior analysis in the area of developmental disability and its strengths and weaknesses. Emphasis is placed on the fact that behavior analysis can continue to provide valuable insights into the education and treatment of people with mental retardation. (Author/CR)
Descriptors: Adults, Behavior Change, Behavior Modification, Behavior Problems
Peer reviewedCheung, Wing Ming; Cheng, Yin Cheong – Educational Research and Evaluation (An International Journal on Theory and Practice), 1998
An instrument developed to study teacher self-management was field-tested in 63 schools with 1,183 teachers. Teacher self-management was conceptualized as a self-iterative process with five stages. The analysis shows that the estimated reliability and predictive and construct validities of the instrument are satisfactory. (SLD)
Descriptors: Educational Research, Evaluation Methods, Foreign Countries, Reliability
Peer reviewedPonterotto, Joseph G.; Baluch, Suraiya; Carielli, Dominick – Measurement and Evaluation in Counseling and Development, 1998
The conceptual basis and development of The Suinn Lew Asian Self-Identity Acculturation Scale (SL-ASIA) are reviewed. This study of psychometric strengths and limitations includes 16 published empirical studies concerning reliability and validity of SL-ASIA. Measures of reliability and methods of establishing construct validity are discussed, and…
Descriptors: Acculturation, Counseling, Measures (Individuals), Meta Analysis
Peer reviewedCizek, Gregory J.; Robinson, K. Lynne; O'Day, Denis M. – Educational and Psychological Measurement, 1998
The effect of removing nonfunctioning items from multiple-choice tests was studied by examining change in difficulty, discrimination, and dimensionality. Results provide additional support for the benefits of eliminating nonfunctioning options, such as enhanced score reliability, reduced testing time, potential for broader domain sampling, and…
Descriptors: Difficulty Level, Multiple Choice Tests, Sampling, Scores
Peer reviewedWinston, Robert B., Jr.; Phelps, Rosemary E.; Mazzeo, Stephanie; Torres, Vasti – College Student Affairs Journal, 1997
Describes the development of the Georgia Autonomy Scales, an instrument used by student affairs professionals to assess the level of autonomy development of students. Estimates of the instrument's reliability and validity were judged sufficiently high to permit use with groups of students. (MKA)
Descriptors: Attitude Measures, College Students, Higher Education, Personal Autonomy
The Psychosocial Inventory of Ego Strengths: Development and Validation of A New Eriksonian Measure.
Peer reviewedMarkstrom, Carol A.; Sabino, Vicky M.; Turner, Bonnie J.; Berman, Rachel C. – Journal of Youth and Adolescence, 1997
The Psychosocial Inventory of Ego Strengths was developed to measure the ego strength concepts of E. Erikson. Two studies, involving 244 and 153 undergraduates in Canada, found evidence for the internal consistency of the eight ego strengths and the overall score, and convergent validity with some other measures of personal characteristics was…
Descriptors: Foreign Countries, Higher Education, Individual Characteristics, Personality Measures
Peer reviewedBall, Andrew M. – Infants and Young Children, 1998
Discusses how meta-analysis allows clinicians to determine objectively both presence and size of an effect or correlation within the existing literature by pooling the results of various studies and performing statistical analyses. Describes the risks and benefits of applying information obtained from meta-analysis into clinical practice.…
Descriptors: Developmental Disabilities, Effect Size, Meta Analysis, Reliability
Peer reviewedPenny, Jim; Johnson, Robert L.; Gordon, Belita – Assessing Writing, 2000
Defines a two-stage process by which a holistic rubric is applied to the assessment of open-ended items, such as writing samples. Indicates that the use of rating augmentation can improve the inter-rater reliability of holistic assessments, as indicated by generalizability phi coefficients, correlation coefficients, and percent agreement indices.…
Descriptors: Grade 5, Holistic Evaluation, Intermediate Grades, Reliability
Garan, Elaine M. – Phi Delta Kappan, 2001
The National Reading Panel admits its evaluation report on phonics is seriously flawed as to organization, methodology, appropriateness of research base, generalizability of results, reliability, validity, and accuracy of data reported. However, an influential public-relations machine is promoting the study's favorable results as unvarnished…
Descriptors: Elementary Education, Meta Analysis, Phonics, Program Evaluation
Peer reviewedJones, Keith; Sinkinson, Anne – Evaluation and Research in Education, 2000
Reports on the first round of inspection by the Office for Standards in Education (England) of providers of postgraduate certificates in education (PGCE) for secondary school mathematics. Almost three-quarters of the providers evaluated in these 21 reports were judged to be good or better, but problems were found with the consistency of the OFSTED…
Descriptors: Evaluation Methods, Foreign Countries, Mathematics, Reliability
Peer reviewedMatson, Johnny L.; Kuhn, David E. – Research in Developmental Disabilities, 2001
A study involving 570 individuals with mental retardation developed the Screening Tool of Feeding Problems, an assessment designed to identify feeding problems presented by persons with mental retardation, and thus facilitate the process of identifying who would benefit from some type of behavioral or medical intervention. Psychometric data are…
Descriptors: Adults, Children, Disability Identification, Eating Disorders
Peer reviewedFerrando, Pere J.; Lorenzo, Urbano; Molina, Gabriel – Applied Psychological Measurement, 2001
Developed an item response theory model of response stability based on the local independence principle. Tested the model, which predicts response changes under repeated administrations of the same instrument, with real data for 432 Spanish undergraduates. Results indicate that the model predictions are approximately fulfilled. (SLD)
Descriptors: Foreign Countries, Higher Education, Item Response Theory, Models


