ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	15

Descriptor

Item Response Theory	19
Test Length	19
Test Reliability	19
Test Items	11
Computer Assisted Testing	9
Scores	7
Error of Measurement	6
Adaptive Testing	5
Comparative Analysis	4
Sample Size	4
Simulation	4
Foreign Countries	3
Models	3
Psychometrics	3
Statistical Analysis	3
Test Construction	3
Test Validity	3
Academic Ability	2
Differences	2
Difficulty Level	2
Elementary School Students	2
Guessing (Tests)	2
Higher Education	2
Item Analysis	2
Measures (Individuals)	2
More ▼

Source

Applied Psychological…	2
ETS Research Report Series	2
AERA Online Paper Repository	1
Applied Measurement in…	1
Assessment and Evaluation in…	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
Eurasian Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Speech, Language,…	1
Measurement and Evaluation in…	1
Montgomery County Public…	1
Psychometrika	1
Research Matters	1
School Psychology Quarterly	1
More ▼

Publication Type

Journal Articles	16
Reports - Research	12
Reports - Evaluative	5
Reports - Descriptive	2
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Education	2
Higher Education	2
Early Childhood Education	1
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

Maryland	1
Taiwan	1
Turkey	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Armed Forces Qualification…	1
MacArthur Communicative…	1
Measures of Academic Progress	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

A Bayesian-Inspired Item Response Theory-Based Framework to Produce Very Short Versions of MacArthur-Bates Communicative Development Inventories

Peer reviewed

Direct link

Chai, Jun Ho; Lo, Chang Huan; Mayor, Julien – Journal of Speech, Language, and Hearing Research, 2020

Purpose: This study introduces a framework to produce very short versions of the MacArthur-Bates Communicative Development Inventories (CDIs) by combining the Bayesian-inspired approach introduced by Mayor and Mani (2019) with an item response theory-based computerized adaptive testing that adapts to the ability of each child, in line with…

Descriptors: Bayesian Statistics, Item Response Theory, Measures (Individuals), Language Skills

Illustration of a Survey Refinement Process Using Psychometric Analysis

Peer reviewed

Direct link

Smith, William Zachary; Dickenson, Tammiee S.; Rogers, Bradley David – AERA Online Paper Repository, 2017

Questionnaire refinement and a process for selecting items for elimination are important tools for survey developers. One of the major obstacles in questionnaire refinement and elimination in surveys lies in one's ability to adequately and appropriately reconstruct a survey. Often times, surveys can be long and strenuous on the respondent,…

Descriptors: Surveys, Psychometrics, Test Construction, Test Reliability

Refining Change Measure with the Rasch Model

Peer reviewed

Direct link

Zaporozhets, Olga; Fox, Christine M.; Beltyukova, Svetlana A.; Laux, John M.; Piazza, Nick J.; Salyers, Kathleen – Measurement and Evaluation in Counseling and Development, 2015

This study was to develop a linear measure of change using University of Rhode Island Change Assessment items that represented Prochaska and DiClemente's theory. The resulting Toledo Measure of Change is short, is easy to use, and provides reliable scores for identification of individuals' stage of change and progression within that stage.

Descriptors: Item Response Theory, Change, Measures (Individuals), Test Construction

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Student Outcomes on MAP Growth: Comparison of Virtual and In-Person Administrations

Download full text

James, Syretta R.; Liu, Shihching Jessica; Maina, Nyambura; Wade, Julie; Wang, Helen; Wilson, Heather; Wolanin, Natalie – Montgomery County Public Schools, 2021

The impact of the COVID-19 pandemic continues to overwhelm the functioning and outcomes of educational systems throughout the nation. The public education system is under particular scrutiny given that students, families, and educators are under considerable stress to maintain academic progress. Since the beginning of the crisis, school-systems…

Descriptors: Achievement Tests, COVID-19, Pandemics, Public Schools

Examination of Polytomous Items' Psychometric Properties According to Nonparametric Item Response Theory Models in Different Test Conditions

Peer reviewed
PDF on ERIC

Download full text

Sengul Avsar, Asiye; Tavsancil, Ezel – Educational Sciences: Theory and Practice, 2017

This study analysed polytomous items' psychometric properties according to nonparametric item response theory (NIRT) models. Thus, simulated datasets--three different test lengths (10, 20 and 30 items), three sample distributions (normal, right and left skewed) and three samples sizes (100, 250 and 500)--were generated by conducting 20…

Descriptors: Test Items, Psychometrics, Nonparametric Statistics, Item Response Theory

Identifying Sets of Maximally Efficient Items from the Academic Competence Evaluation Scales-Teacher Form

Peer reviewed

Direct link

Anthony, Christopher James; DiPerna, James Clyde – School Psychology Quarterly, 2017

The Academic Competence Evaluation Scales-Teacher Form (ACES-TF; DiPerna & Elliott, 2000) was developed to measure student academic skills and enablers (interpersonal skills, engagement, motivation, and study skills). Although ACES-TF scores have demonstrated psychometric adequacy, the length of the measure may be prohibitive for certain…

Descriptors: Test Items, Efficiency, Item Response Theory, Test Length

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Relating Unidimensional IRT Parameters to a Multidimensional Response Space: A Review of Two Alternative Projection IRT Models for Scoring Subscales

Peer reviewed

Direct link

Kahraman, Nilufer; Thompson, Tony – Journal of Educational Measurement, 2011

A practical concern for many existing tests is that subscore test lengths are too short to provide reliable and meaningful measurement. A possible method of improving the subscale reliability and validity would be to make use of collateral information provided by items from other subscales of the same test. To this end, the purpose of this article…

Descriptors: Test Length, Test Items, Alignment (Education), Models

Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

Peer reviewed

Direct link

Yao, Lihua – Psychometrika, 2012

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…

Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing

Differential Item Functioning: Its Consequences. Research Report. ETS RR-10-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2010

This report examines the consequences of differential item functioning (DIF) using simulated data. Its impact on total score, item response theory (IRT) ability estimate, and test reliability was evaluated in various testing scenarios created by manipulating the following four factors: test length, percentage of DIF items per form, sample sizes of…

Descriptors: Test Bias, Item Response Theory, Test Items, Scores

Application of Computerized Adaptive Testing to Entrance Examination for Graduate Studies in Turkey

Peer reviewed
PDF on ERIC

Download full text

Bulut, Okan; Kan, Adnan – Eurasian Journal of Educational Research, 2012

Problem Statement: Computerized adaptive testing (CAT) is a sophisticated and efficient way of delivering examinations. In CAT, items for each examinee are selected from an item bank based on the examinee's responses to the items. In this way, the difficulty level of the test is adjusted based on the examinee's ability level. Instead of…

Descriptors: Adaptive Testing, Computer Assisted Testing, College Entrance Examinations, Graduate Students

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Influence of Test and Person Characteristics on Nonparametric Appropriateness Measurement.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994

The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)

Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

Previous Page | Next Page »

Pages: 1 | 2

Lee, Yi-Hsuan	2
Yao, Lihua	2
Zhang, Jinming	2
Anthony, Christopher James	1
Beltyukova, Svetlana A.	1
Benton, Tom	1
Bulut, Okan	1
Burton, Richard F.	1
Chai, Jun Ho	1
Chen, Hsueh-Chu	1
DiPerna, James Clyde	1
Dickenson, Tammiee S.	1
Fox, Christine M.	1
James, Syretta R.	1
Kahraman, Nilufer	1
Kan, Adnan	1
Laux, John M.	1
Liu, Shihching Jessica	1
Lo, Chang Huan	1
Lunz, Mary E.	1
Maina, Nyambura	1
Mayor, Julien	1
Meijer, Rob R.	1
Patsula, Liane	1
More ▼