Publication Date
| In 2026 | 1 |
| Since 2025 | 599 |
| Since 2022 (last 5 years) | 2536 |
| Since 2017 (last 10 years) | 5571 |
| Since 2007 (last 20 years) | 9167 |
Descriptor
| Test Validity | 21743 |
| Test Reliability | 9997 |
| Test Construction | 5880 |
| Foreign Countries | 4941 |
| Psychometrics | 2956 |
| Factor Analysis | 2938 |
| Measures (Individuals) | 2370 |
| Higher Education | 2248 |
| Evaluation Methods | 2084 |
| College Students | 1810 |
| Correlation | 1722 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 805 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 170 |
| Spain | 168 |
| United Kingdom | 160 |
| Netherlands | 158 |
| California | 155 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedCrocker, Linda; And Others – Journal of Educational Measurement, 1988
Using generalizability theory as a framework, the problem of assessing the content validity of standardized achievement tests is considered. Four designs to assess test-item fit to a curriculum are described, and procedures for determining the optimal number of raters and schools in a content-validation decision-making study are considered. (TJH)
Descriptors: Achievement Tests, Content Validity, Decision Making, Elementary Education
Peer reviewedKenkel, James M.; Tucker, Richard W. – World Englishes, 1989
Outlines an application of theoretical understandings of institutionalized or nativized varieties of English to the practical concern of English-as-a-Second-Language programs, including testing, placement, and pedagogy. (29 references) (Author/OD)
Descriptors: English (Second Language), Error Analysis (Language), Essays, Foreign Countries
Peer reviewedTenhaken, Ursula; Scheibner-Herzig, Gudrun – Journal of Experimental Education, 1988
Achievement in English was measured for 64 German eighth graders by a cloze test, which was evaluated by three methods, yielding similar results. Twenty-one native English speakers comprised a criterion group. The cloze test may be appropriate for evaluating oral communication when an oral interview is not feasible. (SLD)
Descriptors: Achievement Tests, Cloze Procedure, Communicative Competence (Languages), English (Second Language)
Peer reviewedSelby, Edwin C.; And Others – Journal of Creative Behavior, 1993
Innovative or adaptive behavior of 86 eighth-grade students was rated by themselves, parents, and teachers using the Kirton Adaption-Innovation Inventory. The inventory was found to be reliable, stable, and valid. No significant differences between male and female students' scores were exhibited. Correlation with scores on the Comprehensive Test…
Descriptors: Adjustment (to Environment), Adoption (Ideas), Basic Skills, Correlation
Peer reviewedAckerman, Terry A. – Journal of Educational Measurement, 1992
The difference between item bias and item impact and the way they relate to item validity are discussed from a multidimensional item response theory perspective. The Mantel-Haenszel procedure and the Simultaneous Item Bias strategy are used in a Monte Carlo study to illustrate detection of item bias. (SLD)
Descriptors: Causal Models, Computer Simulation, Construct Validity, Equations (Mathematics)
Widaman, Keith F.; And Others – American Journal on Mental Retardation, 1993
Measures of 4traits (cognitive competence, social competence, social maladaption, and personal maladaption) were obtained on 157 persons with mental retardation, using 3 measurements: standardized assessment instrument, day shift staff ratings, and evening shift staff ratings. The multitrait-multimethod matrix procedure demonstrated strong…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Cognitive Ability, Construct Validity
Peer reviewedMoore, Don; And Others – Educational and Psychological Measurement, 1991
Correlations of National Teacher Examination (NTE) Core Battery scores and college grade point average (GPA) with a measure of teaching effectiveness for 493 first-year teachers indicate that the correlation is higher for GPA than for the Core Battery. NTE core scores do not predict effectiveness better than GPA alone. (SLD)
Descriptors: Beginning Teachers, College Graduates, Correlation, Elementary School Teachers
Peer reviewedMcLeod, P. J. – Evaluation and the Health Professions, 1991
Faculty opinions of an evaluation program for medical school clinical tutors were obtained through a survey of 24 undergraduate clinical tutors. Although students had been using the evaluation instrument to rate teachers for five years, faculty expressed many reservations about its reliability and validity. (SLD)
Descriptors: Clinical Teaching (Health Professions), Evaluation Methods, Higher Education, Medical Education
Peer reviewedNisbet, Steven – Mathematics Education Research Journal, 1991
Questionnaire responses of 155 student teachers were analyzed to develop meaningful attitude scales and to refine the instrument. Attitude scales identified in the analysis and built into the final form of the questionnaire were (1) anxiety; (2) confidence and enjoyment, (3) desire for recognition; and (4) pressure to conform. Includes a copy of…
Descriptors: Affective Measures, Attitude Measures, Education Majors, Elementary School Teachers
Peer reviewedBers, Trudy H.; Smith, Kerry E. – Community College Review, 1990
Describes a study of the validity and reliability of a writing skills assessment test taken by 4,284 2-year college students in 1986-87. Assesses interrater reliability, influences of nonperformance factors (e.g., gender, native language, and form of test), predictive validity of test for future performance, and implications of findings. (DMM)
Descriptors: Basic Writing, Community Colleges, High Risk Students, Predictive Validity
Peer reviewedStokes, Julie E.; And Others – Journal of Black Psychology, 1994
This paper investigates the psychometric properties of the African Self-Consciousness (ASC) Scale in a noncollege heterogeneous population of 147 African Americans to determine the reliability and validity of the ASC Scale. Based on analysis of the scale's reliability, factor structure, and construct validity, the study shows the ASC Scale to be a…
Descriptors: Behavior Rating Scales, Behavioral Science Research, Blacks, Construct Validity
Peer reviewedMerrell, Kenneth W. – School Psychology Review, 1993
Constructed School Social Behavior Scales (SSBS) to include teacher-related and peer-related forms of social competence and antisocial behavior. Standardized SSBS using teacher ratings on 1,858 kindergarten through grade 12 students across United States Evidence presented from several related studies in present investigation indicated that SSBS…
Descriptors: Antisocial Behavior, Behavior Rating Scales, Elementary School Students, Elementary Secondary Education
Peer reviewedZwick, Rebecca – Journal of Educational Statistics, 1993
A validity study with 5,219 students examined the degree to which Graduate Management Admission Test (GMAT) scores and undergraduate grade point average (GPA) could predict first-year average and final GPA in doctoral programs in business. The usefulness of the predictions derived from the empirical Bayes regression models is discussed. (SLD)
Descriptors: Administrator Education, Admission Criteria, Bayesian Statistics, Business Education
Peer reviewedGhuman, Jaswinder Kaur; Peebles, Claire D.; Ghuman, Harinder Singh – Infants and Young Children, 1998
A review of 36 social interaction measures found that there are no measures available to evaluate infants and preschool children's basic capacity for social interaction. The available measures are described and grouped into parent-child interaction, social skills, social competence, play, adaptive behavior, communication, general development, and…
Descriptors: Adaptive Behavior (of Disabled), Behavior Problems, Emotional Disturbances, Evaluation Methods
Peer reviewedO'Neil, Harold F.; Abedi, Jamal – Journal of Educational Research, 1996
Describes research on the development of a measure of student metacognition. The brief, domain-independent measure serves as a collateral measure in construct validation, supporting exploration of the self-regulatory demands of performance assessment. Results show that metacognition can be directly and explicitly measured in the context of…
Descriptors: Alternative Assessment, Cognitive Ability, College Students, Elementary Secondary Education


