ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	8

Descriptor

Error of Measurement	12
Scores	12
Test Format	12
Comparative Analysis	4
Item Response Theory	4
Psychometrics	4
Test Items	4
Computer Assisted Testing	3
Correlation	3
Language Proficiency	3
Language Tests	3
Reliability	3
Simulation	3
Test Construction	3
Test Length	3
Test Reliability	3
Advanced Placement	2
College Entrance Examinations	2
English (Second Language)	2
Equated Scores	2
Foreign Countries	2
Models	2
Multiple Choice Tests	2
Reading Comprehension	2
Reading Tests	2
More ▼

Source

ETS Research Report Series	2
ProQuest LLC	2
Education and Information…	1
Educational Measurement:…	1
International Journal of…	1
International Journal of…	1

Publication Type

Reports - Research	8
Journal Articles	6
Dissertations/Theses -…	2
Speeches/Meeting Papers	2
Information Analyses	1
Reports - Descriptive	1

Education Level

Higher Education	2
Postsecondary Education	2
High Schools	1
Secondary Education	1

Audience

Location

Greece	1
Iran	1
Ireland (Dublin)	1

Laws, Policies, & Programs

Assessments and Surveys

College Level Academic Skills…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Animated Videos in Assessment: Comparing Validity Evidence from and Test-Takers' Reactions to an Animated and a Text-Based Situational Judgment Test

Peer reviewed

Direct link

Karakolidis, Anastasios; O'Leary, Michael; Scully, Darina – International Journal of Testing, 2021

The linguistic complexity of many text-based tests can be a source of construct-irrelevant variance, as test-takers' performance may be affected by factors that are beyond the focus of the assessment itself, such as reading comprehension skills. This experimental study examined the extent to which the use of animated videos, as opposed to written…

Descriptors: Animation, Vignettes, Video Technology, Test Format

On the Dimensionality of Reading Comprehension Tests Composed of Text Comprehension Items and Cloze Test Items

Peer reviewed
PDF on ERIC

Download full text

Sheybani, Elias; Zeraatpishe, Mitra – International Journal of Language Testing, 2018

Test method is deemed to affect test scores along with examinee ability (Bachman, 1996). In this research the role of method facet in reading comprehension tests is studied. Bachman divided method facet into five categories, one category is the nature of input and the nature of expected response. This study examined the role of method effect in…

Descriptors: Reading Comprehension, Reading Tests, Test Items, Test Format

ETS Psychometric Contributions: Focus on Test Scores. Research Report. ETS RR-13-15. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-03

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim – ETS Research Report Series, 2013

The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…

Descriptors: Psychometrics, Scores, Correlation, Regression (Statistics)

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Data Collection Design for Equivalent Groups Equating: Using a Matrix Stratification Framework for Mixed-Format Assessment

Direct link

Mbella, Kinge Keka – ProQuest LLC, 2012

Mixed-format assessments are increasingly being used in large scale standardized assessments to measure a continuum of skills ranging from basic recall to higher order thinking skills. These assessments are usually comprised of a combination of (a) multiple-choice items which can be efficiently scored, have stable psychometric properties, and…

Descriptors: Educational Assessment, Test Format, Evaluation Methods, Multiple Choice Tests

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

How Out-of-Level Testing Affects the Psychometric Quality of Test Scores. Out-of-Level Testing Report 2.

Download full text

Bielinski, John; Thurlow, Martha; Minnema, Jane; Scott, Jim – 2000

This report is a review and analysis of the psychometric literature on the topic of out-of-level testing. Out-of-level testing refers to the practice of using a level of the test other than the test taken by most of the students in a student's current grade level. Much of the research on out-of-level testing was conducted in the 1970s and 1980s,…

Descriptors: Achievement Tests, Elementary Secondary Education, Equated Scores, Error of Measurement

Determining the Representation of Constructed Response Items in Mixed-Item Format Exams.

Download full text

Sykes, Robert C.; Truskosky, Denise; White, Hillory – 2001

The purpose of this research was to study the effect of the three different ways of increasing the number of points contributed by constructed response (CR) items on the reliability of test scores from mixed-item-format tests. The assumption of unidimensionality that underlies the accuracy of item response theory model-based standard error…

Descriptors: Constructed Response, Elementary Education, Elementary School Students, Error of Measurement

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)

Investigating Differences in Mean Score on Adaptive and Paper and Pencil Versions of the College Level Academic Skills Reading Test.

Download full text

Legg, Sue M.; Buhr, Dianne C. – 1990

Possible causes of a 16-point mean score increase for the computer adaptive form of the College Level Academic Skills Test (CLAST) in reading over the paper-and-pencil test (PPT) in reading are examined. The adaptive form of the CLAST was used in a state-wide field test in which reading, writing, and computation scores for approximately 1,000…

Descriptors: Adaptive Testing, College Entrance Examinations, Community Colleges, Comparative Testing

Andrews, Benjamin James	1
Bielinski, John	1
Buhr, Dianne C.	1
Gelbal, Selahattin	1
Henning, Grant	1
Karakolidis, Anastasios	1
Kolen, Michael J.	1
Lee, Won-Chan	1
Legg, Sue M.	1
Mbella, Kinge Keka	1
Minnema, Jane	1
Moses, Tim	1
O'Leary, Michael	1
Ozdemir, Burhanettin	1
Patsula, Liane	1
Rizavi, Saba	1
Rotou, Ourania	1
Scott, Jim	1
Scully, Darina	1
Sheybani, Elias	1
Steffen, Manfred	1
Sykes, Robert C.	1
Thurlow, Martha	1
Truskosky, Denise	1
White, Hillory	1
More ▼