Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 154 |
| Since 2017 (last 10 years) | 452 |
| Since 2007 (last 20 years) | 703 |
Descriptor
| Construct Validity | 894 |
| Test Reliability | 894 |
| Foreign Countries | 410 |
| Factor Analysis | 391 |
| Test Validity | 370 |
| Test Construction | 347 |
| Psychometrics | 264 |
| Measures (Individuals) | 220 |
| Factor Structure | 217 |
| Questionnaires | 147 |
| Correlation | 116 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 20 |
| Practitioners | 4 |
| Students | 1 |
| Teachers | 1 |
Location
| Turkey | 126 |
| Indonesia | 24 |
| China | 17 |
| Australia | 16 |
| Malaysia | 14 |
| Florida | 10 |
| Greece | 10 |
| Iran | 10 |
| Taiwan | 10 |
| Netherlands | 9 |
| Spain | 9 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Early Head Start | 1 |
| Education Consolidation… | 1 |
| Elementary and Secondary… | 1 |
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Ward, William C.; And Others – 1986
The keylist format (rather than the conventional multiple-choice format) for item presentation provides a machine-scorable surrogate for a truly free-response test. In this format, the examinee is required to think of an answer, look it up in a long ordered list, and enter its number on an answer sheet. The introduction of keylist items into…
Descriptors: Analogy, Aptitude Tests, Construct Validity, Correlation
Murphy, Christine A.; And Others – 1988
A 32-item Computer Self-Efficacy Scale (CSE) was developed to measure perceptions of capability regarding specific computer-related knowledge and skills. Bandura's theory of self-efficacy (1986) and Schunk's model of classroom learning (1985) guided the development of the CSE. Each of the skill-related items is preceded by the phrase "I feel…
Descriptors: Adult Vocational Education, Computer Literacy, Construct Validity, Graduate Students
Copley, Lisa D.; Meehan, Merrill L.; Howley, Caitlin W.; Hughes, Georgia K. – Appalachia Educational Laboratory at Edvantia (NJ1), 2005
The major purpose of the second field test of the AEL MSCI instrument was to assess the psychometric properties of the refined version with a larger, more diverse group of respondents. The first objective of this field test was to expand the four-point Likert-type response scale to six points in order to yield more variance in responses. The…
Descriptors: Field Tests, Educational Improvement, Evaluation Methods, Teachers
Center for Innovation in Assessment (NJ1), 2005
Research was conducted to evaluate how well the "Indiana Reading Assessment--Grade 1" evaluates various reading skills of grade one students. Multiple analyses were conducted; while the results of all the analyses were encouraging, the results derived from the concurrent validity study were most significant. All the correlations were…
Descriptors: Reading Tests, Test Validity, Test Reliability, Interrater Reliability
Center for Innovation in Assessment (NJ1), 2005
Research was conducted to evaluate how well the "Indiana Reading Assessment--Grade 2" evaluates various reading skills of grade two students. Multiple analyses were conducted; while the results of all the analyses were encouraging, the results derived from the concurrent validity study were most significant. Correlations were either…
Descriptors: Reading Tests, Test Validity, Test Reliability, Interrater Reliability
Peer reviewedMitchell, Mathew – Journal of Educational Psychology, 1993
A hypothetical construct of interest is proposed and tested in the mathematics classroom with 350 high school students. Results indicate that it is useful to distinguish between personal and situational interest. In addition, the structure of situational interest appears multifaceted, with five subfacets found in the high school mathematics…
Descriptors: Adolescents, Construct Validity, Factor Analysis, Factor Structure
Hendrickson, Amy; Patterson, Brian; Melican, Gerald – College Board, 2008
Presented at the Annual National Council on Measurement in Education (NCME) in New York in March 2008. This presentation explores how different item weighting can affect the effective weights, validity coefficents and test reliability of composite scores among test takers.
Descriptors: Multiple Choice Tests, Test Format, Test Validity, Test Reliability
Peer reviewedTurner, Jean – Annual Review of Applied Linguistics, 1998
This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…
Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews
Casillas, Alex; Schulz, E. Matthew; Robbins, Steven B.; Santos, Paulo Jorge; Lee, Richard M. – Journal of Career Assessment, 2006
The present study uses item response theory (IRT) to establish comparability between the English and Portuguese versions of the Goal Instability Scale (GIS), a measure of generalized motivation. A total of 2,848 American and 679 Portuguese high school students were administered their respective language versions of the GIS. Results showed only…
Descriptors: Student Motivation, Cross Cultural Studies, Item Response Theory, Rating Scales
Brand, Pamela A.; Anastasio, Phyllis A. – Journal of Interpersonal Violence, 2006
The 50-item Violence-Related Attitudes and Beliefs Scale (V-RABS) includes three subscales measuring possible causes of violent behavior (environmental influences, biological influences, and mental illness) and four subscales assessing possible controls of violent behavior (death penalty, punishment, prevention, and catharsis). Each subscale…
Descriptors: Measures (Individuals), Violence, Crime, Punishment
DeVaney, Thomas A.; Franks, Melvin E. – 1995
If teachers are teaching a set of standard content and assessments are made consistently, the relationship between these various assessments and the curriculum should be of interest. The purpose of this study was to explore the consistency of the mathematical measures of three prominent forms of mathematics assessment: standardized tests,…
Descriptors: Black Students, Construct Validity, Educational Assessment, Elementary Education
Shavelson, Richard J.; And Others – 1993
One potential approach to the authentic assessment of what students know and can do in science is concept mapping. A concept map is a graph consisting of nodes representing concepts and labeled lines denoting the relation between a pair of nodes (concepts). The external concept map constructed by the student is interpreted as representing…
Descriptors: Cognitive Structures, Concept Mapping, Construct Validity, Educational Assessment
Guerrero, Michael D. – 1994
A study evaluated the overall evaluative validity of the Four Skills Exam, a Spanish language proficiency test designed to ensure that bilingual education teachers in New Mexico can meet Spanish language demands in the bilingual education classroom. The test's construct validity was limited for several reasons. In designing a test capturing…
Descriptors: Bilingual Education, Comparative Analysis, Construct Validity, Elementary Secondary Education
Polat, Filiz – American Annals of the Deaf, 2006
The article present results of standardization of the Meadow-Kendall Social-Emotional Assessment Inventory for Deaf and Hearing-Impaired Students (Meadow, 1983), school-age version, for use in Turkey. The SEAI is a 59-item measure for assessing socioemotional adjustment of school-age deaf and hearing impaired students. A sample of 1,097 deaf…
Descriptors: Turkish, Deafness, Foreign Countries, Emotional Adjustment
Veccia, Ellen M.; Schroeder, David H. – 1990
As a measure of musical aptitude, a new 90-item Pitch Discrimination Test was developed, and its internal structure was examined. Each of the three sections of the test measures an individual's aptitude for pitch discrimination in a different frequency range using square-wave tones generated by a personal computer. A total of 1,303 examinees,…
Descriptors: Ability Identification, Adults, Aptitude Tests, Auditory Discrimination

Direct link
