Publication Date
| In 2026 | 5 |
| Since 2025 | 627 |
| Since 2022 (last 5 years) | 2564 |
| Since 2017 (last 10 years) | 5599 |
| Since 2007 (last 20 years) | 9195 |
Descriptor
| Test Validity | 21771 |
| Test Reliability | 10011 |
| Test Construction | 5891 |
| Foreign Countries | 4955 |
| Psychometrics | 2963 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2377 |
| Higher Education | 2250 |
| Evaluation Methods | 2085 |
| College Students | 1813 |
| Correlation | 1723 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 807 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 172 |
| Spain | 169 |
| United Kingdom | 160 |
| Netherlands | 159 |
| California | 156 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedBachman, Lyle F. – Language Testing, 2000
Reviews developments in language testing research and practice over the last 20 years, and suggests future directions in the areas of professionalizing the field and validation research. Argues that concerns for ethical conduct must be grounded in valid test use, so that professionalization and validation research are inseparable. (Author/VWL)
Descriptors: Ethics, Language Research, Language Tests, Second Language Instruction
Peer reviewedKoretz, Daniel; Stecher, Brian; Klein, Stephen; McCaffrey, Daniel – Educational Measurement: Issues and Practice, 1994
Reports on an ongoing evaluation of the Vermont portfolio assessment program. Indicates that the positive news about the instructional effects of the assessment program are in contrast with the empirical findings about the quality of the data the program has yielded. (SLD)
Descriptors: Accountability, Elementary Secondary Education, Performance Based Assessment, Portfolio Assessment
Peer reviewedSmith, Tina T.; Lee, Evan; McDade, Hiram L. – Communication Disorders Quarterly, 2001
This study investigated the dialectal sensitivity of the T-unit as a nonbiased alternative for assessing the oral grammatical skills of school-age, nonstandard English speakers. Analysis of language samples from 28 9-year-old children (half African-American) revealed no significant differences between groups, suggesting that the T-unit may be a…
Descriptors: Black Dialects, Black Students, Culture Fair Tests, Elementary Education
Peer reviewedMcCracken, Nancy Mellin; McCracken, Hugh Thomas – English Journal, 2001
Asks several teachers what they have lost from their teaching or their classroom since the growth in mandated, standardized testing. Considers the ill effects of mandated testing, and names some educational essentials at risk of being lost while testing rules. Discusses what is lost in high-stakes multiple-choice testing of new teachers. (SG)
Descriptors: High Stakes Tests, Higher Education, Preservice Teachers, Secondary Education
Naevdal, F. – Journal of Adolescence, 2005
The article presents a psychometric description of 11 statements related to use of physical violence. The items were tested in a normal sample (N=1700, age: 15-16) from urban and rural areas in Western Norway. The internal reliability was @a=0.86, and the factor analysis resulted in two factors. Boys had higher mean scores than girls.…
Descriptors: Test Reliability, Predictor Variables, Test Validity, Gender Differences
Wilson, Coralie J.; Deane, Frank P.; Ciarrochi, Joseph; Rickwood, Debra – Canadian Journal of Counselling, 2005
Understanding help seeking intentions and behaviour is fundamental to the identification of factors that can be modified to increase engagement in counselling. Despite considerable research on these variables, integrating prior research has been impeded by a lack of consistent and psychometrically sound help-seeking measures. The General…
Descriptors: Intention, Measures (Individuals), Help Seeking, Student Attitudes
Phillips, Julia C.; Szymanski, Dawn M.; Ozegovic, Jelena Jovanovic; Briggs-Phillips, Melissa – Journal of Counseling Psychology, 2004
Consistent with C. J. Gelso's (1979, 1993, 1997) research training environment theory, the authors hypothesized that research training environments exist in predoctoral internships. The Internship Research Training Environment Scale (IRTES) was developed to assess research training environments found in predoctoral psychology internships.…
Descriptors: Educational Environment, Guidance Centers, Predictive Validity, Counseling Services
Malofeeva, Elena; Day, Jeanne; Saco, Ximena; Young, Laura; Ciancio, Dennis – Journal of Educational Psychology, 2004
The reliability and, to a lesser extent, the validity of the newly created Number Sense Test was evaluated with a group of 40 3- to 5-year-old children attending Head Start. Six number sense skills (e.g., counting, number identification, addition-subtraction) and children's feelings about school were assessed both before and after instruction…
Descriptors: Disadvantaged Youth, Student Attitudes, Preschool Children, Mathematics Skills
Hodges, Timothy D.; Harter, James K. – Educational Horizons, 2005
StrengthsQuest is a student program that focuses on strengths rather than weaknesses. It is intended to lead students to discover their natural talents and gain unique and valuable insights into how to develop such talents into strengths--strengths that equip them to succeed and to make important decisions that enable them to balance the demands…
Descriptors: Test Reliability, Test Validity, Talent Development, Student Empowerment
Weinstock, Jeremiah; Whelan, James P.; Meyers, Andrew W. – Psychological Assessment, 2004
The Gambling Timeline Followback (G-TLFB), a measure of gambling behavior that uses the timeline followback methodology, was psychometrically evaluated with samples of frequent-gambling young adults. Seven dimensions of gambling behavior were assessed: type, frequency, duration, intent, risk, win-loss, and consumption of alcohol while gambling.…
Descriptors: Young Adults, Test Validity, Measures (Individuals), Behavior Patterns
Rodebaugh, Thomas L.; Woods, Carol M.; Thissen, David M.; Heimberg, Richard G.; Chambless, Dianne L.; Rapee, Ronald M. – Psychological Assessment, 2004
Statistical methods designed for categorical data were used to perform confirmatory factor analyses and item response theory (IRT) analyses of the Fear of Negative Evaluation scale (FNE; D. Watson & R. Friend, 1969) and the Brief FNE (BFNE; M. R. Leary, 1983). Results suggested that a 2-factor model fit the data better for both the FNE and the…
Descriptors: Measures (Individuals), Validity, Item Response Theory, Fear
Gersten, Russell; Baker, Scott K.; Haager, Diane; Graves, Anne W. – Remedial & Special Education, 2005
The first portion of this article describes the development and validation of a classroom observation measure. The goal of the measure was to assess the quality of reading instruction provided to first-grade English learners. We report the internal consistency reliability, interrater reliability, the development of empirically derived subscales,…
Descriptors: Second Language Learning, English (Second Language), Reading Instruction, Teacher Effectiveness
Shaftel, Julia; Yang, Xiangdong; Glasnapp, Douglas; Poggio, John – Educational Assessment, 2005
A test designed with built-in modifications and covering the same grade-level mathematics content provided more precise measurement of mathematics achievement for lower performing students with disabilities. Fourth-grade students with disabilities took a test based on modified state curricular standards for their mandated statewide mathematics…
Descriptors: Disabilities, Mathematics Achievement, Item Response Theory, Mathematics Tests
Jung, Lee Ann; McWilliam, R. A. – Journal of Early Intervention, 2005
Evidence is presented regarding the construct validity and internal consistency reliability of scores for an investigator-developed individualized family service plan (IFSP) rating scale. One hundred and twenty IFSPs were rated using a 12-item instrument, the IFSP Rating Scale (McWilliam & Jung, 2001). Using principal components factor…
Descriptors: Test Validity, Rating Scales, Factor Analysis, Construct Validity
Kucuker, Sevgi; Acarlar, Funda; Kapci, Emine G. – Early Child Development and Care, 2006
This study aimed to develop a new scale, the "Supports Scale For Preschool Inclusion" (SSPI), to assess preschool teachers perceptions of necessary factors and availability of supports for a successful inclusion in pre-school educational settings. Pre-school teachers ("n" = 183, mean age = 32.81, standard deviation = 8.29) from…
Descriptors: Foreign Countries, Measures (Individuals), Psychometrics, Preschool Teachers

Direct link
