ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	13

Descriptor

Psychometrics	19
Simulation	19
Test Construction	19
Test Items	11
Computer Assisted Testing	8
Item Response Theory	8
Adaptive Testing	7
Difficulty Level	5
Measurement Techniques	5
Comparative Analysis	4
Scoring	4
Test Reliability	4
Educational Testing	3
Evaluation Research	3
Item Banks	3
Test Format	3
Testing Problems	3
Ability	2
Data Analysis	2
Educational Assessment	2
Equated Scores	2
Evaluation Methods	2
Foreign Countries	2
Identification	2
Mathematics	2
More ▼

Source

Applied Measurement in…	2
ETS Research Report Series	2
Journal of Educational…	2
ProQuest LLC	2
Educational Researcher	1
IAP - Information Age…	1
Journal of Applied Testing…	1
Psychometrika	1
Sociological Methods &…	1
Studies in Educational…	1

Publication Type

Journal Articles	11
Reports - Research	9
Reports - Evaluative	6
Dissertations/Theses -…	2
Books	1
Collected Works - General	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Postsecondary Education	2
Adult Education	1
Elementary Secondary Education	1

Audience

Location

Denmark	1
New Zealand	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Using Existing Data to Inform Development of New Item Types. Research Report. ETS RR-20-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020

With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…

Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics

A Simulation-Based Method for Finding the Optimal Number of Options for Multiple-Choice Items on a Test. Research Report. ETS RR-18-22

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018

For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…

Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction

An SEM Algorithm for Scale Reduction Incorporating Evaluation of Multiple Psychometric Criteria

Peer reviewed

Direct link

Browne, Matthew; Rockloff, Matthew; Rawat, Vijay – Sociological Methods & Research, 2018

Development and refinement of self-report measures generally involves selecting a subset of indicators from a larger set. Despite the importance of this task, methods applied to accomplish this are often idiosyncratic and ad hoc, or based on incomplete statistical criteria. We describe a structural equation modeling (SEM)-based technique, based on…

Descriptors: Structural Equation Models, Scaling, Evaluation Criteria, Psychometrics

Does Maximizing Information at the Cut Score Always Maximize Classification Accuracy and Consistency?

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2016

A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…

Descriptors: Cutting Scores, Psychometrics, Test Construction, Classification

The Effect of Anchor Test Construction on Scale Drift

Peer reviewed

Direct link

Antal, Judit; Proctor, Thomas P.; Melican, Gerald J. – Applied Measurement in Education, 2014

In common-item equating the anchor block is generally built to represent a miniature form of the total test in terms of content and statistical specifications. The statistical properties frequently reflect equal mean and spread of item difficulty. Sinharay and Holland (2007) suggested that the requirement for equal spread of difficulty may be too…

Descriptors: Test Items, Equated Scores, Difficulty Level, Item Response Theory

A Procedure for Dimensionality Analyses of Response Data from Various Test Designs

Peer reviewed

Direct link

Zhang, Jinming – Psychometrika, 2013

In some popular test designs (including computerized adaptive testing and multistage testing), many item pairs are not administered to any test takers, which may result in some complications during dimensionality analyses. In this paper, a modified DETECT index is proposed in order to perform dimensionality analyses for response data from such…

Descriptors: Adaptive Testing, Simulation, Computer Assisted Testing, Test Reliability

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

Random or Fixed Testlet Effects: A Comparison of Two Multilevel Testlet Models

Direct link

Chen, Tzu-An – ProQuest LLC, 2010

This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…

Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models

An Automatic Online Calibration Design in Adaptive Testing

Peer reviewed

Direct link

Makransky, Guido; Glas, Cees A. W. – Journal of Applied Testing Technology, 2010

An accurately calibrated item bank is essential for a valid computerized adaptive test. However, in some settings, such as occupational testing, there is limited access to test takers for calibration. As a result of the limited access to possible test takers, collecting data to accurately calibrate an item bank in an occupational setting is…

Descriptors: Foreign Countries, Simulation, Adaptive Testing, Computer Assisted Testing

Automated Test Assembly for Cognitive Diagnosis Models Using a Genetic Algorithm

Peer reviewed

Direct link

Finkelman, Matthew; Kim, Wonsuk; Roussos, Louis A. – Journal of Educational Measurement, 2009

Much recent psychometric literature has focused on cognitive diagnosis models (CDMs), a promising class of instruments used to measure the strengths and weaknesses of examinees. This article introduces a genetic algorithm to perform automated test assembly alongside CDMs. The algorithm is flexible in that it can be applied whether the goal is to…

Descriptors: Identification, Genetics, Test Construction, Mathematics

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Attitude Research in Science Education: Classic and Contemporary Measurements

Direct link

Saleh, Issa M., Ed.; Khine, Myint Swe, Ed. – IAP - Information Age Publishing, Inc., 2011

The research into how students' attitudes affect their learning of science related subjects has been one of the core areas of interest by science educators. The development in science education records various attempts in measuring attitudes and determining the correlations between behavior, achievements, career aspirations, gender identity and…

Descriptors: Foreign Countries, Indigenous Populations, Student Attitudes, Scientific Attitudes

Multidimensional Adaptive Testing in Educational and Psychological Measurement: Current State and Future Challenges

Peer reviewed

Direct link

Frey, Andreas; Seitz, Nicki-Nils – Studies in Educational Evaluation, 2009

The paper gives an overview of multidimensional adaptive testing (MAT) and evaluates its applicability in educational and psychological testing. The approach of Segall (1996) is described as a general framework for MAT. The main advantage of MAT is its capability to increase measurement efficiency. In simulation studies conceptualizing situations…

Descriptors: Psychological Testing, Adaptive Testing, Simulation, Evaluation Methods

A Comparison of Testlet-Based Test Designs for Computerized Adaptive Testing.

Download full text

Schnipke, Deborah L.; Reese, Lynda M. – 1997

Two-stage and multistage test designs provide a way of roughly adapting item difficulty to test-taker ability. All test takers take a parallel stage-one test, and, based on their scores, they are routed to tests of different difficulty levels in subsequent stages. These designs provide some of the benefits of standard computerized adaptive testing…

Descriptors: Ability, Adaptive Testing, Algorithms, Comparative Analysis

Some Considerations in Maintaining Adaptive Test Item Pools.

Download full text

Stocking, Martha L. – 1988

The construction of parallel editions of conventional tests for purposes of test security while maintaining score comparability has always been a recognized and difficult problem in psychometrics and test construction. The introduction of new modes of test construction, e.g., adaptive testing, changes the nature of the problem, but does not make…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Identification

Previous Page | Next Page »

Pages: 1 | 2

Guo, Hongwen	2
Stocking, Martha L.	2
Antal, Judit	1
Babcock, Ben	1
Betz, Nancy E.	1
Browne, Matthew	1
Chen, Tzu-An	1
Conley, Patrick	1
Finkelman, Matthew	1
Frankel, Lois	1
Frey, Andreas	1
Glas, Cees A. W.	1
Jegerski, Jane	1
Khine, Myint Swe, Ed.	1
Kim, Wonsuk	1
Kyllonen, Patrick	1
Ling, Guangming	1
Makransky, Guido	1
Melican, Gerald J.	1
Meyers, Jason L.	1
Miller, G. Edward	1
Mills, Craig N.	1
Proctor, Thomas P.	1
Rawat, Vijay	1
More ▼