Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 7 |
Descriptor
Comparative Analysis | 19 |
Psychometrics | 19 |
Testing Problems | 19 |
Test Interpretation | 9 |
Measurement Techniques | 7 |
Evaluation Methods | 6 |
Test Use | 6 |
Definitions | 5 |
Educational Assessment | 5 |
Educational Testing | 5 |
Equated Scores | 5 |
More ▼ |
Source
Measurement:… | 5 |
Diagnostique | 1 |
ETS Research Report Series | 1 |
Educational and Psychological… | 1 |
International Journal of… | 1 |
Journal of Clinical Psychology | 1 |
Language Testing | 1 |
Psychology in the Schools | 1 |
Author
Baird, Jo-Anne | 1 |
Bringmann, Wolfgang, G. | 1 |
Caldwell, Mary Lou | 1 |
Caplan, Marc | 1 |
Choppin, Bruce | 1 |
Christian, James K. | 1 |
Cresswell, Mike | 1 |
Foster, Jeff L. | 1 |
Hambleton, Ronald K. | 1 |
Johnson, Nancy E. | 1 |
Kim, Sooyeon | 1 |
More ▼ |
Publication Type
Journal Articles | 10 |
Opinion Papers | 7 |
Reports - Research | 7 |
Speeches/Meeting Papers | 3 |
Information Analyses | 2 |
Reports - Descriptive | 2 |
Reports - Evaluative | 1 |
Education Level
Elementary Secondary Education | 5 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 2 |
SAT (College Admission Test) | 2 |
Wechsler Intelligence Scale… | 2 |
What Works Clearinghouse Rating
Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021
In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…
Descriptors: Testing, Distance Education, Comparative Analysis, Test Items
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Shohamy, Elana; Reves, Thea – Language Testing, 1985
Surveys the development of language tests toward authenticity and discusses the advantages and disadvantages of indirect and direct (authentic) language tests. Discusses the difficulty of applying appropriate psychometric measures to tests using real-life language, and the large number of tests variables which interfere with the authenticity of…
Descriptors: Comparative Analysis, Interviews, Language Tests, Language Usage
Bringmann, Wolfgang, G.; Christian, James K. – 1979
The practice of not sharing tests results with clients may soon be in conflict with the Ethical Standards for Psychologist (sic). Studies using self-validation of feedback information to study feedback parameters have shown that the form of feedback is less important than the content. To investigate direct feedback of test results by computer, the…
Descriptors: Comparative Analysis, Computer Assisted Testing, Cost Effectiveness, Ethics

Lord, Frederic M. – Educational and Psychological Measurement, 1971
A number of empirical studies are suggested to answer certain questions in connection with flexilevel tests. (MS)
Descriptors: Comparative Analysis, Difficulty Level, Guessing (Tests), Item Analysis

Swerdlik, Mark E. – Psychology in the Schools, 1977
The paper reviews WISC/WISC-R comparison studies which have been conducted with a wide variety of samples. Caution is advised in the interpretation of a WISC/WISC-R difference, as a discrepancy of one SD may not be meaningful. (Author)
Descriptors: Comparative Analysis, Intelligence Tests, Literature Reviews, Psychological Testing
Taylor, Ronald L.; Caldwell, Mary Lou – Diagnostique, 1990
The psychometric characteristics of 12 adults with Prader-Willi syndrome (PWS) and a group without PWS but with other similar traits were compared. Results found cognitive, behavioral and educational traits often associated with PWS to be present in both groups, illustrating the importance of control/comparison groups in research establishing…
Descriptors: Adults, Clinical Diagnosis, Comparative Analysis, Handicap Identification
Schnipke, Deborah L.; Reese, Lynda M. – 1997
Two-stage and multistage test designs provide a way of roughly adapting item difficulty to test-taker ability. All test takers take a parallel stage-one test, and, based on their scores, they are routed to tests of different difficulty levels in subsequent stages. These designs provide some of the benefits of standard computerized adaptive testing…
Descriptors: Ability, Adaptive Testing, Algorithms, Comparative Analysis

Caplan, Marc; And Others – Journal of Clinical Psychology, 1982
In two trials, subjects completed the Depression Adjective Checklist as they felt, or were instructed to "fake good,""fake bad," or "fake average." Discussed findings for "fake bad" and "fake good" in terms of ability of an examiner to detect the manipulative set through grossly deviant scores.…
Descriptors: Affective Measures, Comparative Analysis, Depression (Psychology), Psychometrics
Choppin, Bruce; And Others – 1982
A detailed description of five latent structure models of achievement measurement is presented. The first project paper, by David L. McArthur, analyzes the history of mental testing to show how conventional item analysis procedures were developed, and how dissatisfaction with them has led to fragmentation. The range of distinct conceptual and…
Descriptors: Academic Achievement, Achievement Tests, Comparative Analysis, Data Analysis
Phillips, Gary W., Ed. – 1996
Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…
Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics
Previous Page | Next Page ยป
Pages: 1 | 2