Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 1 |
Descriptor
| Testing Problems | 27 |
| Test Reliability | 23 |
| Test Validity | 10 |
| Test Construction | 8 |
| Interrater Reliability | 7 |
| Elementary Secondary Education | 6 |
| Latent Trait Theory | 6 |
| Measurement Techniques | 6 |
| Test Items | 6 |
| Scores | 5 |
| Scoring | 5 |
| More ▼ | |
Source
| Exceptional Children | 2 |
| Journal for Research in… | 2 |
| AEDS Monitor | 1 |
| International Journal of… | 1 |
| Journal of Educational… | 1 |
| Journal of Learning… | 1 |
| Language, Speech, and Hearing… | 1 |
| Learning Disabilities… | 1 |
Author
| Andrich, David | 2 |
| Algina, James | 1 |
| Alliger, R. J. | 1 |
| Brown, Jonathan R. | 1 |
| Busch, John Christian | 1 |
| Cahan, Sorel | 1 |
| Campbell, N. Jo | 1 |
| Cohen, Nora | 1 |
| Constable, Elizabeth | 1 |
| Crowley, Mary L. | 1 |
| Danielle R. Blazek | 1 |
| More ▼ | |
Publication Type
| Speeches/Meeting Papers | 17 |
| Reports - Research | 16 |
| Journal Articles | 10 |
| Reports - Evaluative | 5 |
| Information Analyses | 4 |
| Reports - Descriptive | 2 |
Education Level
Audience
| Researchers | 27 |
| Practitioners | 4 |
| Counselors | 1 |
Location
| Israel | 1 |
| United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Wechsler Intelligence Scale… | 2 |
| Comprehensive Tests of Basic… | 1 |
| Computer Anxiety Scale | 1 |
| Flanders System of… | 1 |
What Works Clearinghouse Rating
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Santmire, Toni E. – 1984
The purpose of this paper is to discuss ways in which developmental psychology suffers from the lack of an appropriate technology of measurement and statistical analysis. The paper begins by noting that developmental psychology is the study of change; that individuals develop through a succession of "stages" which are separated by…
Descriptors: Data Analysis, Data Collection, Developmental Psychology, Developmental Stages
Shale, Doug – 1986
This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests
Goldstein, Harvey; Wolf, Alison – 1986
Locally developed occupational tests were administered to 16- and 17-year-olds in a government-sponsored vocational education program in the United Kingdom over a six-month period in 1984. Job skills were tested in two occupational areas: use of a micrometer and invoice completion. Some performance tests were designed by researchers and some by…
Descriptors: Comparative Testing, Criterion Referenced Tests, Evaluation Criteria, Foreign Countries
Schempp, Paul G. – 1986
The stability of teaching behavior was examined by observing student/teacher interaction over one academic year. One teacher was studied using a time-series analysis. He had 14 years experience and taught physical education in grades K-6 in a single school. Data were collected over one academic year using the Cheffers Adaptation of Flanders…
Descriptors: Behavior Change, Case Studies, Classroom Observation Techniques, Classroom Research
Andrich, David – 1984
Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…
Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability
Alliger, R. J.; Harvey, A. L. – 1984
This article discusses practical and theoretical problems related to the measurement of formal operations. The first section of the article discusses problems in measuring formal operations using the clinical interview method. These problems include the lack of both a standardized interview and a uniform scoring procedure. Section two discusses…
Descriptors: Developmental Stages, Group Testing, Interviews, Objective Tests
Cahan, Sorel; Cohen, Nora – 1987
Two types of classification error are possible in competency tests: erroneous classification of an individual as a "master" of the subject (Type II error), and erroneous classification of a master as a "nonmaster" of the subject (Type I). If steps are taken to minimize Type II errors, an artificially high number of true masters…
Descriptors: Classification, Cutting Scores, Foreign Countries, Mastery Tests
Peer reviewedBrown, Jonathan R. – Language, Speech, and Hearing Services in Schools, 1989
The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)
Descriptors: Elementary Secondary Education, Error of Measurement, Scores, Standardized Tests
Peer reviewedLyon, Mark A. – Journal of Learning Disabilities, 1995
This study examined differences between Wechsler Intelligence Scale for Children-Third Edition (WISC-III) and Wechsler Intelligence Scale for Children-Revised (WISC-R) scores for 40 elementary students with learning disabilities. WISC-III Full Scale, Verbal, and Performance scores were lower than comparable WISC-R scores by one-third to one-half a…
Descriptors: Comparative Analysis, Correlation, Disability Identification, Elementary Education
Campbell, N. Jo – 1986
This study reports the preliminary results of a research project that focuses on the development of an abbreviated measure of computer anxiety, the Computer Anxiety Scale (CAS)-Short Form (SF), designed for use with upper elementary and secondary school students. The subjects involved in the study included 1075 students in grades 4 through 12,…
Descriptors: Anxiety, Comparative Testing, Computers, Intermediate Grades
Nelson, Rosemery O. – 1983
Current status and new developments in behavioral assessment for clinicians and researchers are discussed. The field of behavioral assessment has attained a recognizable identity in recent years. Behavioral assessment can be defined as the identification of meaningful response units and their controlling variables for the purposes of…
Descriptors: Behavior Modification, Behavioral Science Research, Clinical Psychology, Evaluation Criteria
Merrill, Beverly; Peterson, Sarah – 1986
When the Mesa, Arizona Public Schools initiated an ambitious writing instruction program in 1978, two assessments based on student writing samples were developed. The first is based on a ninth grade proficiency test. If the student does not pass the test, high school remediation is provided. After 1987, students must pass this test in order to…
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Graduation Requirements, Holistic Evaluation
Peer reviewedFitzpatrick, Anne R. – Journal of Educational Measurement, 1984
This article reviews the Basic Achievement Skills Individual Screener (BASIS), an individually administered achievement battery that consists of skills tests in reading, mathematics, and spelling as well as an optional writing exercise. BASIS is found to be an effective and efficient means of assessing basic skills. (Author/EGS)
Descriptors: Achievement Tests, Basic Skills, Screening Tests, Test Construction
Stewart, Krista J. – 1985
The Wechsler Intelligence Scale for Children-Revised (WISC-R), one of the most commonly used tests of cognitive ability, is difficult to administer accurately. The purpose of this study was primarily to assess interrater agreement on the WISC-R Administration Observational Checklist (WAOC), a new observational instrument that can be used by an…
Descriptors: Educational Psychology, Elementary Secondary Education, Examiners, Higher Education
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
