Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 11 |
Descriptor
Comparative Analysis | 19 |
Test Theory | 19 |
Test Validity | 19 |
Test Reliability | 12 |
Item Response Theory | 7 |
Test Construction | 7 |
Test Items | 6 |
Correlation | 5 |
Foreign Countries | 5 |
Higher Education | 5 |
Test Interpretation | 5 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 11 |
Reports - Research | 11 |
Speeches/Meeting Papers | 4 |
Reports - Evaluative | 3 |
Opinion Papers | 2 |
Reports - Descriptive | 2 |
Collected Works - Proceedings | 1 |
Dissertations/Theses -… | 1 |
Information Analyses | 1 |
Education Level
Higher Education | 5 |
Postsecondary Education | 5 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Audience
Location
United Kingdom (England) | 2 |
Germany | 1 |
Japan | 1 |
Netherlands | 1 |
Sweden | 1 |
United Kingdom | 1 |
United Kingdom (Northern… | 1 |
United Kingdom (Wales) | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACTFL Oral Proficiency… | 1 |
Defining Issues Test | 1 |
What Works Clearinghouse Rating
Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019
This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…
Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring
Yoshioka, Sérgio R. I.; Ishitani, Lucila – Informatics in Education, 2018
Computerized Adaptive Testing (CAT) is now widely used. However, inserting new items into the question bank of a CAT requires a great effort that makes impractical the wide application of CAT in classroom teaching. One solution would be to use the tacit knowledge of the teachers or experts for a pre-classification and calibrate during the…
Descriptors: Student Motivation, Adaptive Testing, Computer Assisted Testing, Item Response Theory
Longabach, Tanya; Peyton, Vicki – Language Testing, 2018
K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…
Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency
Ishimoto, Michi; Thornton, Ronald K.; Sokoloff, David R. – Physical Review Special Topics - Physics Education Research, 2014
This study assesses the Japanese translation of the Force and Motion Conceptual Evaluation (FMCE). Researchers are often interested in comparing the conceptual ideas of students with different cultural backgrounds. The FMCE has been useful in identifying the concepts of English-speaking students from different backgrounds. To identify effectively…
Descriptors: Test Validity, Physics, Motion, Scientific Concepts
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie – Measurement and Evaluation in Counseling and Development, 2013
Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…
Descriptors: Item Response Theory, Test Theory, Measures (Individuals), Racial Identification
Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014
There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…
Descriptors: Student Surveys, National Surveys, Art Education, Design
Wetzel, Eunike; Hell, Benedikt; Passler, Katja – Journal of Career Assessment, 2012
Three test construction strategies are described and illustrated in the development of the Verb Interest Test (VIT), an inventory that assesses vocational interests using verbs. Verbs might be a promising alternative to the descriptions of occupational activities used in most vocational interest inventories because they are context-independent,…
Descriptors: Test Construction, Culture Fair Tests, Vocational Interests, Interest Inventories
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
Herman, Geoffrey Lindsay – ProQuest LLC, 2011
Instructors in electrical and computer engineering and in computer science have developed innovative methods to teach digital logic circuits. These methods attempt to increase student learning, satisfaction, and retention. Although there are readily accessible and accepted means for measuring satisfaction and retention, there are no widely…
Descriptors: Grounded Theory, Delphi Technique, Concept Formation, Misconceptions
Xu, Yuejin; Iran-Nejad, Asghar; Thoma, Stephen J. – Journal of Interactive Online Learning, 2007
The purpose of the study was to determine comparability of an online version to the original paper-pencil version of Defining Issues Test 2 (DIT2). This study employed methods from both Classical Test Theory (CTT) and Item Response Theory (IRT). Findings from CTT analyses supported the reliability and discriminant validity of both versions.…
Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Test Theory
Goulden, Nancy Rost – 1989
Since speech communication evaluators are beginning to adapt the analytic and holistic instruments and methods used for rating written products to oral products and performance, this research review investigated: (1) what the labels "analytic" and "holistic" mean; (2) the theoretical bases of the two scoring approaches; and (3)…
Descriptors: Comparative Analysis, Higher Education, Holistic Evaluation, Rating Scales
Gamache, LeAnn M. – 1983
Scales constructed under procedures and criteria outlined by the various traditional and latent trait methods were examined as to whether they varied in characteristics related to scale quality. Scales were constructed from a common pool of items analyzed in full form according to Likert and a one-parameter Rasch model for non-dichotomous data.…
Descriptors: Comparative Analysis, Correlation, Higher Education, Item Analysis
Vazulik, Johannes; Brown, Cheri – 1989
A study supplementing earlier research by Lalande and Schweckendiek investigated comparisons and correlations obtained from testing a group of 17 university students of German using both the American Council on the Teaching of Foreign Languages (ACTFL) Oral Proficiency Interview (OPI) and the most recent revision of the examination for the…
Descriptors: Comparative Analysis, Comparative Testing, Correlation, German
McDaniel, Barbara A. – 1985
A study was conducted to determine whether evaluators of large scale essay tests respond the same way toward essays written by English as a second language (ESL) and non-ESL students. The data examined came from the English Placement Test (EPT) administered in the province of British Columbia, Canada, in March 1979. The test was used to identify…
Descriptors: Chinese, Comparative Analysis, English (Second Language), Higher Education
Previous Page | Next Page »
Pages: 1 | 2