Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 16 |
Descriptor
Comparative Analysis | 26 |
Psychometrics | 26 |
Test Interpretation | 26 |
Evaluation Methods | 15 |
Measurement Techniques | 12 |
Foreign Countries | 9 |
Test Use | 9 |
Testing Problems | 9 |
Educational Assessment | 8 |
Educational Testing | 8 |
Equated Scores | 7 |
More ▼ |
Source
Author
Newton, Paul E. | 2 |
Appenzellar, Anne B. | 1 |
Baird, Jo-Anne | 1 |
Bank, Jurgen | 1 |
Bartram, Dave | 1 |
Besel, Ronald | 1 |
Bramley, Tom | 1 |
Bringmann, Wolfgang, G. | 1 |
Bulut, Okan | 1 |
Caplan, Marc | 1 |
Christian, James K. | 1 |
More ▼ |
Publication Type
Journal Articles | 18 |
Reports - Research | 11 |
Opinion Papers | 5 |
Reports - Descriptive | 4 |
Reports - Evaluative | 4 |
Speeches/Meeting Papers | 3 |
Numerical/Quantitative Data | 1 |
Education Level
Elementary Secondary Education | 8 |
Elementary Education | 2 |
Grade 6 | 1 |
Grade 7 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
United Kingdom | 5 |
United States | 4 |
Australia | 3 |
United Kingdom (England) | 3 |
United Kingdom (Wales) | 2 |
Canada | 1 |
South Africa | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Cormier, Damien C.; Bulut, Okan; McGrew, Kevin S.; Kennedy, Kathleen – Journal of Intelligence, 2022
Consideration of the influence of English language skills during testing is an understandable requirement for fair and valid cognitive test interpretation. Several professional standards and expert recommendations exist to guide psychologists as they attempt to engage in best practices when assessing English learners (ELs). Nonetheless, relatively…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Culture Fair Tests
Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014
In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…
Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Newton, Paul E. – Research Papers in Education, 2010
Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…
Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Bartram, Dave – International Journal of Testing, 2008
The article discusses issues relating to the international use of personality inventories, especially those in which organizations make comparisons between people from differing cultures or countries or those with different languages. The focus is on the issue of norming and the use of national versus multinational norms. It is noted that…
Descriptors: Guidelines, Norms, Cultural Differences, Global Approach
Bringmann, Wolfgang, G.; Christian, James K. – 1979
The practice of not sharing tests results with clients may soon be in conflict with the Ethical Standards for Psychologist (sic). Studies using self-validation of feedback information to study feedback parameters have shown that the form of feedback is less important than the content. To investigate direct feedback of test results by computer, the…
Descriptors: Comparative Analysis, Computer Assisted Testing, Cost Effectiveness, Ethics

McCusker, Paul J. – Psychological Assessment, 1994
Three short forms of the Wechsler Adult Intelligence Scale-Revised (WAIS-R), developed in 1991, were cross-validated on 207 male and 133 female adolescent psychiatric inpatients and outpatients. Results show psychometric properties for the short forms that are comparable to those of the WAIS-R standardization sample. (SLD)
Descriptors: Adolescents, Clinical Diagnosis, Comparative Analysis, Intelligence Tests

Swerdlik, Mark E. – Psychology in the Schools, 1977
The paper reviews WISC/WISC-R comparison studies which have been conducted with a wide variety of samples. Caution is advised in the interpretation of a WISC/WISC-R difference, as a discrepancy of one SD may not be meaningful. (Author)
Descriptors: Comparative Analysis, Intelligence Tests, Literature Reviews, Psychological Testing

Caplan, Marc; And Others – Journal of Clinical Psychology, 1982
In two trials, subjects completed the Depression Adjective Checklist as they felt, or were instructed to "fake good,""fake bad," or "fake average." Discussed findings for "fake bad" and "fake good" in terms of ability of an examiner to detect the manipulative set through grossly deviant scores.…
Descriptors: Affective Measures, Comparative Analysis, Depression (Psychology), Psychometrics
Previous Page | Next Page ยป
Pages: 1 | 2