ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	14

Descriptor

Comparative Analysis	15
Test Theory	15
Item Response Theory	7
College Students	6
Foreign Countries	5
Test Validity	5
Correlation	4
Multiple Choice Tests	4
Scores	4
Test Items	4
College Entrance Examinations	3
Computation	3
Language Proficiency	3
Measures (Individuals)	3
Questionnaires	3
Scientific Concepts	3
Statistical Analysis	3
Student Surveys	3
Test Reliability	3
Cognitive Tests	2
College Science	2
Error of Measurement	2
Interviews	2
Language Tests	2
Measurement Techniques	2
More ▼

Source

ProQuest LLC	3
ACT, Inc.	1
Applied Psychological…	1
College Entrance Examination…	1
International Online Journal…	1
Journal of Educational and…	1
Journal of Interactive Online…	1
Measurement and Evaluation in…	1
Pegem Journal of Education…	1
Physical Review Special…	1
Research in Higher Education	1
Studies in Higher Education	1
TESL-EJ	1
More ▼

Publication Type

Reports - Research	11
Journal Articles	10
Dissertations/Theses -…	3
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education	15
Postsecondary Education	15
Early Childhood Education	1
Elementary Education	1
Grade 2	1
High Schools	1
Primary Education	1
Secondary Education	1

Audience

Location

United States	2
California	1
Japan	1
Puerto Rico	1
Sweden	1
Taiwan	1
Turkey	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Defining Issues Test	1
English Proficiency Test	1
National Assessment of…	1
SAT (College Admission Test)	1
Test of English for…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Comparison of Classical Test Theory vs. Multi-Facet Rasch Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022

Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…

Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Validating the Japanese Translation of the Force and Motion Conceptual Evaluation and Comparing Performance Levels of American and Japanese Students

Peer reviewed

Direct link

Ishimoto, Michi; Thornton, Ronald K.; Sokoloff, David R. – Physical Review Special Topics - Physics Education Research, 2014

This study assesses the Japanese translation of the Force and Motion Conceptual Evaluation (FMCE). Researchers are often interested in comparing the conceptual ideas of students with different cultural backgrounds. The FMCE has been useful in identifying the concepts of English-speaking students from different backgrounds. To identify effectively…

Descriptors: Test Validity, Physics, Motion, Scientific Concepts

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

Development and Calibration of a Concept Inventory to Measure Introductory College Astronomy and Physics Students' Understanding of Newtonian Gravity

Direct link

Williamson, Kathryn Elizabeth – ProQuest LLC, 2013

The topic of Newtonian gravity offers a unique vantage point from which to investigate and encourage conceptual change because it is something with which everyone has daily experience, and because it is taught in two courses that reach a wide variety of students--introductory-level college astronomy ("Astro 101") and physics ("Phys…

Descriptors: Scientific Concepts, Science Tests, College Science, Astronomy

An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

Peer reviewed

Direct link

Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie – Measurement and Evaluation in Counseling and Development, 2013

Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

Descriptors: Item Response Theory, Test Theory, Measures (Individuals), Racial Identification

Hit by a Perfect Storm? Art & Design in the National Student Survey

Peer reviewed

Direct link

Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014

There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…

Descriptors: Student Surveys, National Surveys, Art Education, Design

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

Measuring Student Involvement: A Comparison of Classical Test Theory and Item Response Theory in the Construction of Scales from Student Surveys

Peer reviewed

Direct link

Sharkness, Jessica; DeAngelo, Linda – Research in Higher Education, 2011

This study compares the psychometric utility of Classical Test Theory (CTT) and Item Response Theory (IRT) for scale construction with data from higher education student surveys. Using 2008 Your First College Year (YFCY) survey data from the Cooperative Institutional Research Program at the Higher Education Research Institute at UCLA, two scales…

Descriptors: Student Surveys, Measures (Individuals), Psychometrics, Item Response Theory

Learner Washback Variability in Standardized Exit Tests

Peer reviewed
PDF on ERIC

Download full text

Pan, Yi-Ching – TESL-EJ, 2014

In much of the world, the issue of accountability and measurement of educational outcomes is highly controversial. Exit testing is part of the movement to ascertain what students have learned and hold institutions and teachers to account. However, compared to the large number of teacher washback studies, learner washback research is lacking…

Descriptors: Standardized Tests, Exit Examinations, Questionnaires, College Students

An Examination of High School Graduates Who Identify Teachers as Influential in Their Choice of College

Direct link

Mozie-Ross, Yvette D. – ProQuest LLC, 2011

This exploratory study contributes to what is known about the college choice process by providing a quantitative comparative analysis to determine how high school graduates who identify teachers as influential in their choice of college differ from graduates who do not. Specifically, this study answers the following research question: How do…

Descriptors: College Choice, Grade Point Average, Statistical Analysis, Comparative Analysis

The Development of a Digital Logic Concept Inventory

Direct link

Herman, Geoffrey Lindsay – ProQuest LLC, 2011

Instructors in electrical and computer engineering and in computer science have developed innovative methods to teach digital logic circuits. These methods attempt to increase student learning, satisfaction, and retention. Although there are readily accessible and accepted means for measuring satisfaction and retention, there are no widely…

Descriptors: Grounded Theory, Delphi Technique, Concept Formation, Misconceptions

Administering Defining Issues Test Online: Do Response Modes Matter?

Peer reviewed

Direct link

Xu, Yuejin; Iran-Nejad, Asghar; Thoma, Stephen J. – Journal of Interactive Online Learning, 2007

The purpose of the study was to determine comparability of an online version to the original paper-pencil version of Defining Issues Test 2 (DIT2). This study employed methods from both Classical Test Theory (CTT) and Item Response Theory (IRT). Findings from CTT analyses supported the reliability and discriminant validity of both versions.…

Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Test Theory

Linking Scores from Tests of Similar Content Given in Different Languages: Spanish Language PAA™ and English Language SAT® I. Research Report No. 2003-5. ETS RR-03-20

Download full text

Cascallar, Alicia S.; Dorans, Neil J. – College Entrance Examination Board, 2003

Score linkages between the Verbal and Math sections of the SAT® I: Reasoning Test and the corresponding sections of the new version of a Spanish-language admissions test, the Prueba de Aptitud Academica® (PAA™) were investigated. A bilingual group design was employed. A language proficiency measure (ESLAT) was used to define the bilingual group…

Descriptors: Equated Scores, Spanish, English, Language of Instruction

Polat, Murat	2
Almehrizi, Rashid S.	1
Beaujean, A. Alexander	1
Blair, Bernadette	1
Cascallar, Alicia S.	1
Cui, Zhongmin	1
DeAngelo, Linda	1
Dorans, Neil J.	1
Fang, Yu	1
Herman, Geoffrey Lindsay	1
Iran-Nejad, Asghar	1
Ishimoto, Michi	1
Mozie-Ross, Yvette D.	1
Orr, Susan	1
Pan, Yi-Ching	1
Ramsay, James O.	1
Sharkness, Jessica	1
Sokoloff, David R.	1
Sussman, Joshua	1
Thoma, Stephen J.	1
Thornton, Ronald K.	1
Toraman, Cetin	1
Traynor, Anne	1
Turhan, Nihan S.	1
Watson, Stevie	1
More ▼