ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Comparative Testing	17
Computer Assisted Testing	17
Item Response Theory	17
Adaptive Testing	10
Estimation (Mathematics)	5
Test Items	5
Higher Education	4
Test Format	4
Test Length	4
Certification	3
Item Banks	3
Licensing Examinations…	3
Mastery Tests	3
Mathematical Models	3
Psychometrics	3
Scores	3
Test Construction	3
Test Validity	3
Ability Identification	2
College Students	2
Computer Simulation	2
Difficulty Level	2
Equations (Mathematics)	2
Evaluation Methods	2
Foreign Countries	2
More ▼

Source

Journal of Educational…	3
Applied Psychological…	2
Journal of Technology,…	2
Applied Measurement in…	1
Australian Journal of Early…	1
Educational Measurement:…	1

Publication Type

Reports - Research	11
Journal Articles	10
Speeches/Meeting Papers	7
Reports - Evaluative	6
Information Analyses	1

Education Level

Early Childhood Education	1
Elementary Education	1
High Schools	1
Middle Schools	1
Preschool Education	1

Audience

Location

Australia	1
Canada	1
New Zealand	1
United Kingdom (England)	1
United Kingdom (Scotland)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation

Peer reviewed

Direct link

Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024

Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…

Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment

Measuring What Students Entering School Know and Can Do: PIPS Australia 2006-2007

Peer reviewed

Direct link

Wildy, Helen; Styles, Irene – Australian Journal of Early Childhood, 2008

This paper reports analysis of 2006-2007 on-entry assessment data from the Performance Indicators in Primary Schools Baseline Assessment (PIPS-BLA) of random samples of students in England, Scotland, New Zealand and Australia. The analysis aimed, first, to investigate the validity and reliability of that instrument across countries and sexes, and,…

Descriptors: National Competency Tests, Foreign Countries, Student Evaluation, Comparative Education

Does Survey Medium Affect Responses? An Exploration of Electronic and Paper Surveying in British Colombia Schools

Peer reviewed
PDF on ERIC

Download full text

Direct link

Walt, Nancy; Atwood, Kristin; Mann, Alex – Journal of Technology, Learning, and Assessment, 2008

The purpose of this study was to determine whether or not survey medium (electronic versus paper format) has a significant effect on the results achieved. To compare survey media, responses from elementary students to British Columbia's Satisfaction Survey were analyzed. Although this study was not experimental in design, the data set served as a…

Descriptors: Student Attitudes, Factor Analysis, Foreign Countries, Elementary School Students

Comparability of Computer and Paper-and-Pencil Versions of Algebra and Biology Assessments

Peer reviewed
PDF on ERIC

Download full text

Direct link

Kim, Do-Hong; Huynh, Huynh – Journal of Technology, Learning, and Assessment, 2007

This study examined comparability of student scores obtained from computerized and paper-and-pencil formats of the large-scale statewide end-of-course (EOC) examinations in the two subject areas of Algebra and Biology. Evidence in support of comparability of computerized and paper-based tests was sought by examining scale scores, item parameter…

Descriptors: Computer Assisted Testing, Measures (Individuals), Biology, Algebra

Using Patterns of Summed Scores in Paper-and-Pencil Tests and Computer-Adaptive Tests to Detect Misfitting Item Score Patterns

Peer reviewed

Direct link

Meijer, Rob R. – Journal of Educational Measurement, 2004

Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…

Descriptors: Probability, Adaptive Testing, Item Response Theory, Scores

Equating Scores from Adaptive to Linear Tests

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Equated Scores

Performance Modeling That Integrates Latent Trait and Class Theory.

Peer reviewed

Gitomer, Drew H.; Yamamoto, Kentaro – Journal of Educational Measurement, 1991

A model integrating latent trait and latent class theories in characterizing individual performance on the basis of qualitative understanding is presented. This HYBRID model is illustrated through experiments with 119 Air Force technicians taking a paper-and-pencil test and 136 Air Force technicians taking a computerized test. (SLD)

Descriptors: Comparative Testing, Computer Assisted Testing, Educational Assessment, Item Response Theory

The Nominal Response Model in Computerized Adaptive Testing.

Peer reviewed

De Ayala, R. J. – Applied Psychological Measurement, 1992

A computerized adaptive test (CAT) based on the nominal response model (NR CAT) was implemented, and the performance of the NR CAT and a CAT based on the three-parameter logistic model was compared. The NR CAT produced trait estimates comparable to those of the three-parameter test. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Equations (Mathematics)

Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.

Download full text

Sykes, Robert C.; And Others – 1991

To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…

Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing

A Comparison of Self-Adapted and Computer-Adaptive Tests.

Download full text

Wise, Steven L.; And Others – 1991

According to item response theory (IRT), examinee ability estimation is independent of the particular set of test items administered from a calibrated pool. Although the most popular application of this feature of IRT is computerized adaptive (CA) testing, a recently proposed alternative is self-adapted (SA) testing, in which examinees choose the…

Descriptors: Ability Identification, Adaptive Testing, College Students, Comparative Testing

The Nominal Response Model in Computerized Adaptive Testing.

Download full text

De Ayala, R. J. – 1992

One important and promising application of item response theory (IRT) is computerized adaptive testing (CAT). The implementation of a nominal response model-based CAT (NRCAT) was studied. Item pool characteristics for the NRCAT as well as the comparative performance of the NRCAT and a CAT based on the three-parameter logistic (3PL) model were…

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Monte Carlo Simulation Comparison of Two-Stage Testing and Computerized Adaptive Testing.

Download full text

Kim, Haeok; Plake, Barbara S. – 1993

A two-stage testing strategy is one method of adapting the difficulty of a test to an individual's ability level in an effort to achieve more precise measurement. A routing test provides an initial estimate of ability level, and a second-stage measurement test then evaluates the examinee further. The measurement accuracy and efficiency of item…

Descriptors: Ability, Adaptive Testing, Comparative Testing, Computer Assisted Testing

The Effect of Review on Student Ability and Test Efficiency for Computer Adaptive Tests.

Lunz, Mary E.; And Others – 1991

This paper explores the effect of reviewing items and altering responses on the efficiency of computer adaptive tests (CATs) and the resultant ability measures of examinees. Subjects included 712 medical students: 220 subjects were randomly assigned to the review condition; 492 were randomly assigned to a review control condition (the usual CAT…

Descriptors: Academic Ability, Adaptive Testing, Certification, Comparative Testing

Use of Restricted Item Response Theory Models for Examining the Stability of Item Parameter Estimates over Time.

Peer reviewed

Stone, Clement A.; Lane, Suzanne – Applied Measurement in Education, 1991

A model-testing approach for evaluating the stability of item response theory item parameter estimates (IPEs) in a pretest-posttest design is illustrated. Nineteen items from the Head Start Measures Battery were used. A moderately high degree of stability in the IPEs for 5,510 children assessed on 2 occasions was found. (TJH)

Descriptors: Comparative Testing, Compensatory Education, Computer Assisted Testing, Early Childhood Education

Test-Retest Consistency of Computer Adaptive Tests.

Lunz, Mary E.; And Others – 1990

This study explores the test-retest consistency of computer adaptive tests of varying lengths. The testing model used was designed as a mastery model to determine whether an examinee's estimated ability level is above or below a pre-established criterion expressed in the metric (logits) of the calibrated item pool scale. The Rasch model was used…

Descriptors: Ability Identification, Adaptive Testing, College Students, Comparative Testing

Previous Page | Next Page »

Pages: 1 | 2

Lunz, Mary E.	3
De Ayala, R. J.	2
Wise, Steven L.	2
Atwood, Kristin	1
Bergstrom, Betty A.	1
Gitomer, Drew H.	1
Huynh, Huynh	1
Jimmy de la Torre	1
Jinran Wu	1
Kim, Do-Hong	1
Kim, Haeok	1
Lane, Suzanne	1
Mann, Alex	1
Meijer, Rob R.	1
Plake, Barbara S.	1
Stone, Clement A.	1
Styles, Irene	1
Sykes, Robert C.	1
Walt, Nancy	1
Wildy, Helen	1
Xuelan Qiu	1
Yamamoto, Kentaro	1
You-Gan Wang	1
van der Linden, Wim J.	1
More ▼