NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 131 results Save | Export
Benton, Tom; Williamson, Joanna – Research Matters, 2022
Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…
Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022
Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…
Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering
Peer reviewed Peer reviewed
Direct linkDirect link
Meyer, Emily M.; Reynolds, Matthew R. – Journal of Psychoeducational Assessment, 2018
The purpose of this study was to use multidimensional scaling (MDS) to investigate relations among scores from the standardization sample of the Wechsler Intelligence Scale for Children--Fifth edition (WISC-V; Wechsler, 2014). Nonmetric two-dimensional MDS maps were selected for interpretation. The most cognitively complex subtests and indexes…
Descriptors: Children, Intelligence Tests, Scaling, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Peer reviewed Peer reviewed
Miley, Alan D. – Educational and Psychological Measurement, 1980
The tendency to extreme scores (TES) can affect sensitive indices, such as Cattell's coefficient of pattern similarity, so that a flat profile will, in general, be found more similar to a standard than will an extreme profile. TES is especially critical when profile matching is used in clinical diagnosis. (Author/BW)
Descriptors: Clinical Diagnosis, Profiles, Statistical Analysis, Test Interpretation
Andrich, David – 1984
Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…
Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability
Calhoun, William Ford – 1976
This report documents (1) the problems inherent in multiple choice testing, (2) a solution to the problems, and (3) computer programs required by the solution. Problems of multiple choice testing include scheduling inflexibility, methodological inflexibility, cheating, inefficiencies of space and student interaction time, inefficiencies of…
Descriptors: Computer Programs, Higher Education, Multiple Choice Tests, Test Construction
Peer reviewed Peer reviewed
Creaser, James W.; Jacobs, Mitchell – Journal of Counseling Psychology, 1987
Strong-Campbell Interest Inventory answer sheets for 300 male university freshmen were scored via both the 1981 and 1985 scoring systems. Communalities of the profiles generated by the two scoring systems indicated considerable profile variance. Counselors should thoroughly understand changes made in the new instrument. (Author/NB)
Descriptors: College Freshmen, Higher Education, Interest Inventories, Males
Peer reviewed Peer reviewed
Spencer, Bruce D. – Journal of Educational Measurement, 1983
Because test scores are ordinal not cordinal attributes, the average test score often is a misleading way to summarize the scores of a group of individuals. Similarly, correlation coefficients may be misleading summary measures of association between test scores. Proper, readily interpretable, summary statistics are developed from a theory of…
Descriptors: Correlation, Measurement Techniques, Scores, Statistical Analysis
Peer reviewed Peer reviewed
Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984
Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)
Descriptors: Correlation, Intelligence Tests, Profiles, Scores
Peer reviewed Peer reviewed
Murphy, R. J. L. – British Journal of Educational Psychology, 1979
Two senior GCE examiners re-marked photocopies of the same 200 GCE examination scripts, half still containing the marks and comments of the original examiners and half with these markings removed. Removing previous markings made a considerable difference to the extent of agreement between these sets of marks. (Editor/SJL)
Descriptors: Essay Tests, Examiners, Grading, Reliability
Miller, Margery Silberman – 1984
The paper provides a theoretical framework for the inclusion of a verbal intelligence test as part of the psychodiagnostic assessment battery used with deaf children. Descriptions are provided for three selected sign language varieties being used in a study designed to examine performance of 30 deaf children (9-16 years old) on signed…
Descriptors: Deafness, Elementary Secondary Education, Intelligence Tests, Sign Language
Peer reviewed Peer reviewed
Tatsuoka, Kikumi, K.; Tatsuoka, Maurice M. – Journal of Educational Statistics, 1982
Two indices for measuring the degree of conformity or consistency of an individual examinee's response pattern on a set of items are developed. The use of the indices for spotting aberrant response patterns of examinees is detailed. (Author/JKS)
Descriptors: Error of Measurement, Error Patterns, Goodness of Fit, Item Analysis
Hoover, Randy L.; Kadunc, Nancy – 1983
The purpose of this paper is to examine the nature of discrepancy score phenomena of the Myers-Briggs Type Indicator (MBTI), as related to internal consistency and construct validity of the instrument. Data were collected from 140 university research managers. The data suggest internal consistency problems: only 37.3 percent of the subjects…
Descriptors: Adults, Personality Measures, Personality Traits, Sampling
Peer reviewed Peer reviewed
Chase, Christopher H.; Sattler, Jerome M. – School Psychology Review, 1980
Sattler's standard deviation technique for interpreting strengths and weaknesses on the Stanford-Binet Intelligence Scale has been simplified by Kaufman and Waterstreet in the form of an easy-to-use table. A refinement of their table is presented, with an example to demonstrate its use. (Author/CTM)
Descriptors: Chronological Age, Error of Measurement, Intelligence Tests, Mental Age
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9