NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)0
Since 2007 (last 20 years)2
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 41 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Taskinen, Päivi H.; Steimel, Jochen; Gräfe, Linda; Engell, Sebastian; Frey, Andreas – Peabody Journal of Education, 2015
This study examined students' competencies in engineering education at the university level. First, we developed a competency model in one specific field of engineering: process dynamics and control. Then, the theoretical model was used as a frame to construct test items to measure students' competencies comprehensively. In the empirical…
Descriptors: Models, Engineering Education, Test Items, Outcome Measures
Peer reviewed Peer reviewed
Direct linkDirect link
Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012
A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…
Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring
Peer reviewed Peer reviewed
Burnett, J. Dale – Educational and Psychological Measurement, 1974
The general use of the Spearman-Brown formula for calculating the reliability of parallel tests with different lengths is reviewed. The importance of the assumption that the component tests be parallel is noted and the property that parallel tests must be non-negatively correlated is derived. (Author)
Descriptors: Statistical Analysis, Test Reliability, Testing Problems
Smith, Leon I.; Greenberg, Sandra – 1973
A discussion of selected applications of new tests developed within the context of a large-scale curriculum for educable mentally retarded (EMR) children, the Social Learning Curriculum (SLC), is presented in this paper which investigates three types of reliability that need to be demonstrated in order to provide a basis of these applications. The…
Descriptors: Curriculum Evaluation, Educational Research, Evaluation Methods, Measurement Techniques
Peer reviewed Peer reviewed
Kaiser, Henry F. – Educational and Psychological Measurement, 1980
The use of Bayes' estimates for proportions in the Law of Comparative Judgment is suggested to avoid sample proportions of zero and one. (Author)
Descriptors: Bayesian Statistics, Comparative Analysis, Reliability, Statistical Analysis
Peer reviewed Peer reviewed
Brennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980
Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)
Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability
Peer reviewed Peer reviewed
Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984
Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)
Descriptors: Correlation, Intelligence Tests, Profiles, Scores
Peer reviewed Peer reviewed
Cureton, Edward E. – Educational and Psychological Measurement, 1971
A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)
Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability
Frary, Robert B. – 1982
Three measures of person-fit (the extent to which an examinee's response pattern on a multiple-choice test is consistent with his ability as estimated by total score) were computed for students taking classroom tests under 12 different instructors at a comprehensive university. Supplementary questions on each test inquired concerning students'…
Descriptors: Higher Education, Multiple Choice Tests, Predictive Validity, Reliability
Baumgartner, Ted A. – Res Quart AAHPER, 1969
Descriptors: Measurement, Physical Education, Physical Examinations, Physical Fitness
Braun, John R.; Asta, Patricia – Meas Evaluation Guidance, 1969
This report is based on a paper presented at the annual meeting of the Educational Research Association of the New York State, Kiamesha Lake, New York, November 7, 1968
Descriptors: Adjustment (to Environment), College Freshmen, Measurement Instruments, Personality Assessment
Peer reviewed Peer reviewed
Weiss, David J., Ed. – Applied Psychological Measurement, 1987
Issues concerning equating test scores are discussed in an introduction, four papers, and two commentaries. Equating methods research, sampling errors, linear equating, population differences, sources of equating errors, and a circular equating paradigm are considered. (SLD)
Descriptors: Equated Scores, Latent Trait Theory, Maximum Likelihood Statistics, Statistical Analysis
Peer reviewed Peer reviewed
Rindler, Susan Ellerin – Journal of Educational Measurement, 1979
A sample of the literature on test speededness is reviewed; methods of assessing speededness are presented and criticized; the assumptions that underlie these methods are questioned, and alternate, multiple-administration methods are suggested. The importance of the effect of time limits is discussed. (Author/CTM)
Descriptors: Literature Reviews, Measurement Techniques, Reaction Time, Statistical Analysis
Peer reviewed Peer reviewed
Chapman, Loren; Chapman, Jean P. – American Journal of Mental Deficiency, 1975
Descriptors: Difficulty Level, Exceptional Child Research, Mental Retardation, Research Methodology
Kapes, Jerome T. – 1975
Two independent studies were conducted to investigate possible differences in General Aptitude Test Battery (GATB) aptitude M resulting from the use of different test equipment (wooden vs. plastic apparatus.) As part of a ten-year longitudinal study of Vocational Development being conducted in the Department of Vocational Education at The…
Descriptors: Aptitude Tests, Comparative Analysis, Elementary Secondary Education, Scores
Previous Page | Next Page »
Pages: 1  |  2  |  3