Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Source
Applied Psychological… | 1 |
Eurasian Journal of… | 1 |
Online Submission | 1 |
Pearson | 1 |
Psychological Assessment | 1 |
Author
Ackerman, Terry A. | 2 |
Anderson, A. E. | 1 |
Applebaum, Wayne R. | 1 |
Axelrod, Bradley N. | 1 |
Baghi, Heibatollah | 1 |
Battaile, Richard | 1 |
Beck, Michael | 1 |
Bell, Anita I. | 1 |
Binici, Salih | 1 |
Brennan, Robert L. | 1 |
Bump, Wren M. | 1 |
More ▼ |
Publication Type
Speeches/Meeting Papers | 38 |
Reports - Research | 20 |
Reports - Evaluative | 12 |
Journal Articles | 3 |
Reports - Descriptive | 3 |
Numerical/Quantitative Data | 2 |
Education Level
Elementary Secondary Education | 2 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 5 |
Location
Arizona | 1 |
Delaware | 1 |
India | 1 |
Texas (Dallas) | 1 |
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Öztürk Gübes, Nese – Eurasian Journal of Educational Research, 2021
Purpose: In grading, one of the most common errors is made in combining two or more different test scores. This study aimed to investigate the agreement of grades calculated by weighting raw scores and standard scores. Research Methods: In this simulation study, data were simulated for midterm and final measurements. Nine conditions [3 (class…
Descriptors: Grading, Raw Scores, Weighted Scores, Norm Referenced Tests
Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012
The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…
Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests
Gafoor, K. Abdul – Online Submission, 2011
This study explores interest in physics, chemistry and biology among school students in Kerala. It used a sample of 3236 (1659 boys, 1577 girls) students studying in upper primary to higher secondary classes. Three separate versions of scale of interest in science were used to quantify interest in science of upper primary, secondary and higher…
Descriptors: Physics, Females, Chemistry, Student Attitudes

Brennan, Robert L. – Applied Psychological Measurement, 1998
Provides a comprehensive and integrated treatment of both conditional absolute standard errors of measurement (SEM) and conditional relative SEMs from the perspective of generalizability theory. Illustrates the approach with examples from commercial standardized tests. Examples support the conclusion that both types of conditional SEMs tend to be…
Descriptors: Error of Measurement, Generalizability Theory, Raw Scores, Standardized Tests
Hoffman, R. Gene; Wise, Lauress L. – 2000
Classical test theory is based on the concept of a true score for each examinee, defined as the expected or average score across an infinite number of repeated parallel tests. In most cases, there is only a score from a single administration of the test in question. The difference between this single observed score and the underlying true score is…
Descriptors: Achievement, Classification, Observation, Probability
Buras, Avery – 1996
The logic and uses of test equating are discussed, including three methods of test equating. The focus is on the conceptual underpinnings of each test equating method, rather than on the mathematics of the procedures. Additional consideration is given to the assumptions of each method and its respective strengths and weaknesses. A commonly…
Descriptors: Equated Scores, Item Response Theory, Models, Raw Scores
Frisbie, David A. – 1981
The relative difficulty ratio (RDR) is used as a method of representing test difficulty. The RDR is the ratio of a test mean to the ideal mean, the point midway between the perfect score and the mean chance score for the test. The RDR tranformation is a linear scale conversion method but not a linear equating method in the classical sense. The…
Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Raw Scores
Jaeger, Richard M. – 1974
Two relatively new tools for analysis of data compiled in evaluation studies are presented. The National Test-Equating Study in Reading, known as the Anchor Test Study, produced tables of score-correspondence between the eight reading comprehension and vocabulary tests most widely used in the United States. Two types of tables from this report…
Descriptors: Data Analysis, Equated Scores, Evaluation Methods, Raw Scores
Schumacker, Randall E. – 1998
In comparing measurement theories, it is evident that the awareness of the concept of measurement error during the time of Galileo has lead to the formulation of observed scores comprising a true score and error (classical theory), universe score and various random error components (generalizability theory), or individual latent ability and error…
Descriptors: Comparative Analysis, Computer Software, Error of Measurement, Generalizability Theory
Bump, Wren M. – 1991
The normal curve has long been important in statistics. Most interval variables yield normal or quasi-normal distributions when data are collected from large samples, and the normal "Z" distribution is also used as a test statistic (e.g., to test differences between two means when sample size is large, since "t" approaches…
Descriptors: Data Collection, Equations (Mathematics), Functions (Mathematics), Graphs
Phillips, S. E.; Anderson, A. E. – 1983
The LOGTRUE program can be used to obtain a scale of equated raw scores for two tests with parameter estimates on a common item response theory scale. The program derives its name from the method of logistic true score equating described by Lord (1980). The method can be applied to two tests with overlapping items administered to different groups…
Descriptors: Computer Programs, Equated Scores, Group Testing, Latent Trait Theory

Woodard, John L.; Axelrod, Bradley N. – Psychological Assessment, 1995
Using 308 patients referred for neuropsychological evaluation, 2 regression equations were developed to predict weighted raw score sums for General Memory and Delayed Recall using the Wechsler Memory Scale-Revised (WMS-R) analogs of 5 subtests from the original WMS. The equations may help reduce WMS-R administration time. (SLD)
Descriptors: Equations (Mathematics), Memory, Neuropsychology, Patients
Pommerich, Mary; And Others – 1995
The Mantel-Haenszel (MH) statistic for identifying differential item functioning (DIF) commonly conditions on the observed test score as a surrogate for conditioning on latent ability. When the comparison group distributions are not completely overlapping (i.e., are incongruent), the observed score represents different levels of latent ability…
Descriptors: Ability, Comparative Analysis, Difficulty Level, Item Bias
Yang, Wen-Ling – 1997
Using an anchor-item design of test equating, the effects of three equating methods (Tucker linear and two three-parameter item-response-theory-based (3PL-IRT) methods), and the content representativeness of anchor items on the accuracy of equating were examined; and an innovative way of evaluating equating accuracy appropriate for the particular…
Descriptors: Equated Scores, Item Response Theory, Raw Scores, Test Construction
Campbell, Kathleen Taylor; Tucker, Mary L. – 1992
Since canonical correlation analysis subsumes multiple regression as a special case, and since commonality analysis (a variance partitioning procedure) has proven useful in interpreting multiple regression results, the interpretation of canonical correlation results might also be enhanced by the use of commonality analysis. In this paper, a…
Descriptors: Assistant Principals, Correlation, Elementary Secondary Education, Multivariate Analysis