Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Statistical Analysis | 36 |
Test Reliability | 36 |
Testing Problems | 36 |
Test Validity | 13 |
Elementary Secondary Education | 9 |
Scores | 9 |
Test Construction | 9 |
Test Interpretation | 7 |
Equated Scores | 6 |
Mathematical Models | 6 |
Measurement Techniques | 6 |
More ▼ |
Source
Author
Bormuth, John R. | 2 |
ANDRADE, MANUEL | 1 |
Algina, James | 1 |
Andrulis, Richard S. | 1 |
Asta, Patricia | 1 |
Barford, Sean W. | 1 |
Barker, Pierce | 1 |
Baumgartner, Ted A. | 1 |
Braun, John R. | 1 |
Brennan, Robert L. | 1 |
Budescu, David | 1 |
More ▼ |
Publication Type
Reports - Research | 19 |
Journal Articles | 7 |
Speeches/Meeting Papers | 3 |
Collected Works - Serials | 2 |
Guides - Non-Classroom | 1 |
Information Analyses | 1 |
Opinion Papers | 1 |
Reference Materials -… | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 2 |
Elementary Secondary Education | 1 |
Postsecondary Education | 1 |
Audience
Practitioners | 1 |
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Taskinen, Päivi H.; Steimel, Jochen; Gräfe, Linda; Engell, Sebastian; Frey, Andreas – Peabody Journal of Education, 2015
This study examined students' competencies in engineering education at the university level. First, we developed a competency model in one specific field of engineering: process dynamics and control. Then, the theoretical model was used as a frame to construct test items to measure students' competencies comprehensively. In the empirical…
Descriptors: Models, Engineering Education, Test Items, Outcome Measures
Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012
A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…
Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring

Burnett, J. Dale – Educational and Psychological Measurement, 1974
The general use of the Spearman-Brown formula for calculating the reliability of parallel tests with different lengths is reviewed. The importance of the assumption that the component tests be parallel is noted and the property that parallel tests must be non-negatively correlated is derived. (Author)
Descriptors: Statistical Analysis, Test Reliability, Testing Problems

Brennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980
Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)
Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability

Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984
Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)
Descriptors: Correlation, Intelligence Tests, Profiles, Scores

Cureton, Edward E. – Educational and Psychological Measurement, 1971
A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)
Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability
Baumgartner, Ted A. – Res Quart AAHPER, 1969
Descriptors: Measurement, Physical Education, Physical Examinations, Physical Fitness
Braun, John R.; Asta, Patricia – Meas Evaluation Guidance, 1969
This report is based on a paper presented at the annual meeting of the Educational Research Association of the New York State, Kiamesha Lake, New York, November 7, 1968
Descriptors: Adjustment (to Environment), College Freshmen, Measurement Instruments, Personality Assessment

Weiss, David J., Ed. – Applied Psychological Measurement, 1987
Issues concerning equating test scores are discussed in an introduction, four papers, and two commentaries. Equating methods research, sampling errors, linear equating, population differences, sources of equating errors, and a circular equating paradigm are considered. (SLD)
Descriptors: Equated Scores, Latent Trait Theory, Maximum Likelihood Statistics, Statistical Analysis

Rindler, Susan Ellerin – Journal of Educational Measurement, 1979
A sample of the literature on test speededness is reviewed; methods of assessing speededness are presented and criticized; the assumptions that underlie these methods are questioned, and alternate, multiple-administration methods are suggested. The importance of the effect of time limits is discussed. (Author/CTM)
Descriptors: Literature Reviews, Measurement Techniques, Reaction Time, Statistical Analysis

Chapman, Loren; Chapman, Jean P. – American Journal of Mental Deficiency, 1975
Descriptors: Difficulty Level, Exceptional Child Research, Mental Retardation, Research Methodology
Kapes, Jerome T. – 1975
Two independent studies were conducted to investigate possible differences in General Aptitude Test Battery (GATB) aptitude M resulting from the use of different test equipment (wooden vs. plastic apparatus.) As part of a ten-year longitudinal study of Vocational Development being conducted in the Department of Vocational Education at The…
Descriptors: Aptitude Tests, Comparative Analysis, Elementary Secondary Education, Scores

Budescu, David – Journal of Educational Measurement, 1985
An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)
Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests
Andrulis, Richard S.; And Others – 1974
The purpose of this investigation was to establish the effects of repeaters on test equating. Since consideration was not given to repeaters in test equating, such as in the derivation of equations by Angoff (1971), the hypothetical effect needed to be established. A case study was examined which showed results on a test as expected; overall mean…
Descriptors: Cutting Scores, Equated Scores, Recall (Psychology), Retention (Psychology)
Larsson, Bernt – Didakometry, 1974
Subjects are asked to answer six questions, partly with a frequency and partly by marking a verbally anchored scale with five categories. Some univariate and multivariate analyses are performed to elucidate the relations between variables with the two different modes of response. Although there are similarities in results for the two types of…
Descriptors: Measurement Techniques, Measures (Individuals), Rating Scales, Responses