ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	11

Descriptor

Comparative Analysis	19
Test Theory	19
Test Validity	19
Test Reliability	12
Item Response Theory	7
Test Construction	7
Test Items	6
Correlation	5
Foreign Countries	5
Higher Education	5
Test Interpretation	5
Item Analysis	4
Scores	4
Statistical Analysis	4
College Students	3
Computer Assisted Testing	3
Criterion Referenced Tests	3
Difficulty Level	3
Evaluation Methods	3
Item Banks	3
Measurement Techniques	3
Measures (Individuals)	3
Norm Referenced Tests	3
Scoring	3
Test Bias	3
More ▼

Source

Communique	1
Informatics in Education	1
Journal of Career Assessment	1
Journal of Interactive Online…	1
Journal on Educational…	1
Language Testing	1
Measurement and Evaluation in…	1
Measurement:…	1
Performance and Instruction	1
Physical Review Special…	1
ProQuest LLC	1
Studies in Higher Education	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	11
Speeches/Meeting Papers	4
Reports - Evaluative	3
Opinion Papers	2
Reports - Descriptive	2
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Information Analyses	1

Education Level

Higher Education	5
Postsecondary Education	5
Elementary Education	1
Elementary Secondary Education	1

Audience

Location

United Kingdom (England)	2
Germany	1
Japan	1
Netherlands	1
Sweden	1
United Kingdom	1
United Kingdom (Northern…	1
United Kingdom (Wales)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

ACTFL Oral Proficiency…	1
Defining Issues Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

An Adaptive Test Analysis Based on Students' Motivation

Peer reviewed
PDF on ERIC

Download full text

Yoshioka, Sérgio R. I.; Ishitani, Lucila – Informatics in Education, 2018

Computerized Adaptive Testing (CAT) is now widely used. However, inserting new items into the question bank of a CAT requires a great effort that makes impractical the wide application of CAT in classroom teaching. One solution would be to use the tacit knowledge of the teachers or experts for a pre-classification and calibrate during the…

Descriptors: Student Motivation, Adaptive Testing, Computer Assisted Testing, Item Response Theory

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Validating the Japanese Translation of the Force and Motion Conceptual Evaluation and Comparing Performance Levels of American and Japanese Students

Peer reviewed

Direct link

Ishimoto, Michi; Thornton, Ronald K.; Sokoloff, David R. – Physical Review Special Topics - Physics Education Research, 2014

This study assesses the Japanese translation of the Force and Motion Conceptual Evaluation (FMCE). Researchers are often interested in comparing the conceptual ideas of students with different cultural backgrounds. The FMCE has been useful in identifying the concepts of English-speaking students from different backgrounds. To identify effectively…

Descriptors: Test Validity, Physics, Motion, Scientific Concepts

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

Peer reviewed

Direct link

Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie – Measurement and Evaluation in Counseling and Development, 2013

Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

Descriptors: Item Response Theory, Test Theory, Measures (Individuals), Racial Identification

Hit by a Perfect Storm? Art & Design in the National Student Survey

Peer reviewed

Direct link

Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014

There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…

Descriptors: Student Surveys, National Surveys, Art Education, Design

Comparison of Different Test Construction Strategies in the Development of a Gender Fair Interest Inventory Using Verbs

Peer reviewed

Direct link

Wetzel, Eunike; Hell, Benedikt; Passler, Katja – Journal of Career Assessment, 2012

Three test construction strategies are described and illustrated in the development of the Verb Interest Test (VIT), an inventory that assesses vocational interests using verbs. Verbs might be a promising alternative to the descriptions of occupational activities used in most vocational interest inventories because they are context-independent,…

Descriptors: Test Construction, Culture Fair Tests, Vocational Interests, Interest Inventories

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

The Development of a Digital Logic Concept Inventory

Direct link

Herman, Geoffrey Lindsay – ProQuest LLC, 2011

Instructors in electrical and computer engineering and in computer science have developed innovative methods to teach digital logic circuits. These methods attempt to increase student learning, satisfaction, and retention. Although there are readily accessible and accepted means for measuring satisfaction and retention, there are no widely…

Descriptors: Grounded Theory, Delphi Technique, Concept Formation, Misconceptions

Administering Defining Issues Test Online: Do Response Modes Matter?

Peer reviewed

Direct link

Xu, Yuejin; Iran-Nejad, Asghar; Thoma, Stephen J. – Journal of Interactive Online Learning, 2007

The purpose of the study was to determine comparability of an online version to the original paper-pencil version of Defining Issues Test 2 (DIT2). This study employed methods from both Classical Test Theory (CTT) and Item Response Theory (IRT). Findings from CTT analyses supported the reliability and discriminant validity of both versions.…

Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Test Theory

Theoretical and Empirical Comparisons of Holistic and Analytic Scoring of Written and Spoken Discourse.

Download full text

Goulden, Nancy Rost – 1989

Since speech communication evaluators are beginning to adapt the analytic and holistic instruments and methods used for rating written products to oral products and performance, this research review investigated: (1) what the labels "analytic" and "holistic" mean; (2) the theoretical bases of the two scoring approaches; and (3)…

Descriptors: Comparative Analysis, Higher Education, Holistic Evaluation, Rating Scales

Comparison of Traditional and Latent Trait Procedures in Analysis and Selection of Rating Scale Items.

Gamache, LeAnn M. – 1983

Scales constructed under procedures and criteria outlined by the various traditional and latent trait methods were examined as to whether they varied in characteristics related to scale quality. Scales were constructed from a common pool of items analyzed in full form according to Likert and a one-parameter Rasch model for non-dichotomous data.…

Descriptors: Comparative Analysis, Correlation, Higher Education, Item Analysis

Another Look at Correlations between the Oral Proficiency Interview and the Zertifikat Deutsch als Fremdsprache.

Vazulik, Johannes; Brown, Cheri – 1989

A study supplementing earlier research by Lalande and Schweckendiek investigated comparisons and correlations obtained from testing a group of 17 university students of German using both the American Council on the Teaching of Foreign Languages (ACTFL) Oral Proficiency Interview (OPI) and the most recent revision of the examination for the…

Descriptors: Comparative Analysis, Comparative Testing, Correlation, German

Ratings Vs. Equity in the Evaluation of Writing.

McDaniel, Barbara A. – 1985

A study was conducted to determine whether evaluators of large scale essay tests respond the same way toward essays written by English as a second language (ESL) and non-ESL students. The data examined came from the English Placement Test (EPT) administered in the province of British Columbia, Canada, in March 1979. The test was used to identify…

Descriptors: Chinese, Comparative Analysis, English (Second Language), Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Haladyna, Tom	2
Alqarni, Abdulelah Mohammed	1
Baird, Jo-Anne	1
Beaujean, A. Alexander	1
Blair, Bernadette	1
Brown, Cheri	1
Gamache, LeAnn M.	1
Goulden, Nancy Rost	1
Hell, Benedikt	1
Herman, Geoffrey Lindsay	1
Iran-Nejad, Asghar	1
Ishimoto, Michi	1
Ishitani, Lucila	1
Longabach, Tanya	1
McDaniel, Barbara A.	1
Mitchell, Alison M.	1
Orr, Susan	1
Passler, Katja	1
Petscher, Yaacov	1
Peyton, Vicki	1
Roid, Gale	1
Shrock, Sharon	1
Sokoloff, David R.	1
Sussman, Joshua	1
More ▼