Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Smith, Douglas K.; And Others – 1994
This study examined the relationship between scores on the Wechsler Intelligence Scale for Children-III (WISC-III) and the older Wechsler Intelligence Scale for Children-Revised (WISC-R). School psychologists in Wisconsin were asked to provide data on 300 special education re-evaluations completed during the 1992-93 academic year. Pearson product…
Descriptors: Disabilities, Elementary Secondary Education, Intelligence Tests, Psychometrics
Youngjohn, James R.; And Others – 1991
Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…
Descriptors: Adults, Comparative Testing, Computer Assisted Testing, Computer Simulation
Clonts, Jean G. – 1992
This paper presents a review of the literature on reliability in qualitative studies. Reliability is defined as the extent to which studies can be replicated, using the same methods, and getting the same results. It is the degree to which data are independent of the accidental circumstances of the research. The review includes the following three…
Descriptors: Data Collection, Estimation (Mathematics), Generalizability Theory, Literature Reviews
Shiarella, Ann Harris; McCarthy, Anne M.; Tucker, Mary L. – 1999
The multi-stage development of the Community Service Attitudes Scale (CSAS), an instrument for measuring college students' attitudes about community service, is reported. The CSAS was developed based on the helping behavior model of S. Schwartz (1977). The developed instrument was tested with two samples of 437 and 332 college students. The scales…
Descriptors: Attitude Measures, College Students, Community Services, Factor Analysis
Kreft, Ita G. G. – 1992
The analysis of small group data with hierarchical linear models is discussed, concentrating on the usefulness and reliability of such analyses using data reported by N. M. Webb (1982). Results of Webb's analyses for 96 junior high school students in small groups are compared with results obtained with random effects linear models for the analysis…
Descriptors: Groups, Junior High School Students, Junior High Schools, Regression (Statistics)
Rogers, James R.; DeShon, Richard P. – 1992
The lack of systematic psychometric information on the Suicide Opinion Questionnaire (SOQ) was addressed by investigating the factor structure and reliability of the eight-factor clinical scale model (mental illness, cry for help, right to die, religion, impulsivity, normality, aggression, and moral evil), developed for interpreting responses to…
Descriptors: Factor Structure, Higher Education, Item Analysis, Models
Ferguson, Gibson; Maclean, Joan – Edinburgh Working Papers in Linguistics, 1991
This study is the first stage of a wider enquiry into alternative ways of assessing the readability of specialist texts. The interest in assessing these texts arose from the need to grade 60 medical journal articles for an individualized English-as-a-Foreign-Language reading scheme for doctors. The study reports on an investigation of subjective…
Descriptors: Difficulty Level, English (Second Language), Foreign Countries, Interrater Reliability
ERIC Clearinghouse on Disabilities and Gifted Education, Reston, VA. ERIC/OSEP Special Project on Interagency Information Dissemination. – 1993
This annotated bibliography on performance assessment in schools lists 16 journal articles and 2 reports which were published between 1991 and 1993. Citations address the following aspects of performance assessment: conditions for alternative assessments, validity and reliability issues, accountability issues, performance assessment in science and…
Descriptors: Accountability, Competency Based Education, Elementary Secondary Education, Performance
Lumley, Tom; McNamara, T. F. – 1993
Recent developments in multi-faceted Rasch measurement (Linacre, 1989) have made possible new kinds of investigations of aspects of performance assessments. Bias analysis, interactions between elements of any facet, can also be analyzed, which permits investigation of the way a particular aspect of the test situation may elicit a consistently…
Descriptors: English (Second Language), Experimenter Characteristics, Foreign Countries, Interrater Reliability
Skinner, Robert E. – 1994
The merits and disadvantages of standardized and informal reading tests for limited English proficient readers are discussed. A growing reliance on standardized ("formal") tests due to their ease of administration and scoring is criticized because the tests are seen as: inadequate for describing students at high and low ends of the scale; not…
Descriptors: Comparative Analysis, English (Second Language), Limited English Speaking, Reading Tests
Thompson, Lynn; And Others – 1988
A project is reported that developed a test for students in foreign language in the elementary school (FLES) programs. Relevant tests in Spanish and English as a Second Language (ESL) were reviewed in order to develop a listening and reading test that could determine achievement in a typical FLES curriculum. Pilot testing was conducted with 121…
Descriptors: FLES, Intermediate Grades, Language Tests, Second Language Instruction
Roberts, William L.; Schill, Loreen G. – 1991
The collection of observational data in natural settings and in real time requires equipment that is light and easily used, and programs that permit rapid and flexible encoding of data. This paper describes a set of four programs for collecting and analyzing continuous time sample, focal-individual data as described by J. Altmann (1974), using a…
Descriptors: Computer Software, Data Analysis, Data Collection, Educational Research
Schael, Jocelyne; Dionne, Jean-Paul – 1991
The basis of agreement or disagreement among judges/evaluators when applying a coding scheme to concurrent verbal protocols was studied. The sample included 20 university graduates, from varied backgrounds; 10 subjects had and 10 subjects did not have experience in protocol analysis. The total sample was divided into four balanced groups according…
Descriptors: Adults, College Graduates, Comparative Analysis, Encoding (Psychology)
Trevisan, Michael S. – 1991
Some of the issues regarding the estimation of reliability for performance assessments are explored, and a methodology is suggested for determination of reliability. In performance assessment, the magnitude of the reliability coefficient may be less than that obtained for a standardized test, since part of the potential use of performance…
Descriptors: Analysis of Variance, Computer Software, Educational Assessment, Estimation (Mathematics)
A Comparison of the Kansas Marital Satisfaction Scale and the Locke-Wallace Marital Adjustment Test.
White, Mark B.; And Others – 1990
Past research has suggested that the Kansas Marital Satisfaction Scale (KMS) is a brief, reliable, and valid measure of marital satisfaction. This study was conducted to: (1) examine responses on the KMS from a national sample of couples; (2) assess the construct validity of the KMS through a comparison with the Locke-Wallace Marital Adjustment…
Descriptors: Adjustment (to Environment), Construct Validity, Evaluation Methods, Marital Satisfaction


