ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Comparative Testing	11
Computer Simulation	11
Test Items	7
Item Response Theory	5
Mathematical Models	5
Estimation (Mathematics)	4
Scores	4
Adaptive Testing	3
Computer Assisted Testing	3
Equations (Mathematics)	3
Item Bias	3
Statistical Distributions	3
Test Length	3
Control Groups	2
Correlation	2
Error of Measurement	2
Experimental Groups	2
Foreign Countries	2
Item Banks	2
Licensing Examinations…	2
Monte Carlo Methods	2
Racial Differences	2
Research Methodology	2
Sample Size	2
Ability	1
More ▼

Source

Applied Psychological…	1
Journal of Educational…	1
Online Submission	1

Author

Bayrak, Bekir	1
Chang, Yu-Wen	1
Davison, Mark L.	1
De Ayala, R. J.	1
Du Bose, Pansy	1
Ingec, Sebnem Kandil	1
Kanli, Uygar	1
Kim, Haeok	1
Kluge, Annette	1
Kromrey, Jeffrey D.	1
Mazor, Kathleen M.	1
Miller, Timothy R.	1
Nandakumar, Ratna	1
Plake, Barbara S.	1
Spray, Judith A.	1
Sykes, Robert C.	1
Wainer, Howard	1
More ▼

Publication Type

Reports - Evaluative	11
Speeches/Meeting Papers	6
Journal Articles	3
Reports - Research	1
Tests/Questionnaires	1

Education Level

Grade 9

Audience

Location

Germany	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Performance Assessments with Microworlds and Their Difficulty

Peer reviewed

Direct link

Kluge, Annette – Applied Psychological Measurement, 2008

The use of microworlds (MWs), or complex dynamic systems, in educational testing and personnel selection is hampered by systematic measurement errors because these new and innovative item formats are not adequately controlled for their difficulty. This empirical study introduces a way to operationalize an MW's difficulty and demonstrates the…

Descriptors: Personnel Selection, Self Efficacy, Educational Testing, Computer Uses in Education

To Compare the Effects of Computer Based Learning and the Laboratory Based Learning on Students' Achievement Regarding Electric Circuits

Download full text

Bayrak, Bekir; Kanli, Uygar; Ingec, Sebnem Kandil – Online Submission, 2007

In this study, the research problem was: "Is the computer based physics instruction as effective as laboratory intensive physics instruction with regards to academic success on electric circuits 9th grade students?" For this research of experimental quality the design of pre-test and post-test are applied with an experiment and a control…

Descriptors: Control Groups, Student Attitudes, Academic Achievement, Physics

Identification of Non-Uniform Differential Item Functioning Using a Variation of the Mantel-Haenszel Procedure.

Download full text

Mazor, Kathleen M.; And Others – 1993

The Mantel-Haenszel (MH) procedure has become one of the most popular procedures for detecting differential item functioning (DIF). One of the most troublesome criticisms of this procedure is that while detection rates for uniform DIF are very good, the procedure is not sensitive to non-uniform DIF. In this study, examinee responses were generated…

Descriptors: Comparative Testing, Computer Simulation, Item Bias, Item Response Theory

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

A Comparison of Unidimensional and Multidimensional IRT Approaches to Test Information in a Test Battery.

Download full text

Chang, Yu-Wen; Davison, Mark L. – 1992

Standard errors and bias of unidimensional and multidimensional ability estimates were compared in a factorial, simulation design with two item response theory (IRT) approaches, two levels of test correlation (0.42 and 0.63), two sample sizes (500 and 1,000), and a hierarchical test content structure. Bias and standard errors of subtest scores…

Descriptors: Comparative Testing, Computer Simulation, Correlation, Error of Measurement

Assessing the Impact of Multidimensionality on the Classification Decisions of an IRT-Based Licensure Examination.

Download full text

Sykes, Robert C.; And Others – 1992

A part-form methodology was used to study the effect of varying degrees of multidimensionality on the consistency of pass/fail classification decisions obtained from simulated unidimensional item response theory (IRT) based licensure examinations. A control on the degree of form multidimensionality permitted an assessment throughout the range of…

Descriptors: Classification, Comparative Testing, Computer Simulation, Decision Making

The Nominal Response Model in Computerized Adaptive Testing.

Download full text

De Ayala, R. J. – 1992

One important and promising application of item response theory (IRT) is computerized adaptive testing (CAT). The implementation of a nominal response model-based CAT (NRCAT) was studied. Item pool characteristics for the NRCAT as well as the comparative performance of the NRCAT and a CAT based on the three-parameter logistic (3PL) model were…

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Monte Carlo Simulation Comparison of Two-Stage Testing and Computerized Adaptive Testing.

Download full text

Kim, Haeok; Plake, Barbara S. – 1993

A two-stage testing strategy is one method of adapting the difficulty of a test to an individual's ability level in an effort to achieve more precise measurement. A routing test provides an initial estimate of ability level, and a second-stage measurement test then evaluates the examinee further. The measurement accuracy and efficiency of item…

Descriptors: Ability, Adaptive Testing, Comparative Testing, Computer Assisted Testing

Performance of the Mantel-Haenszel Statistic and the Standardized Difference in Proportions Correct When Population Ability Distributions Are Incongruent.

Download full text

Spray, Judith A.; Miller, Timothy R. – 1992

A popular method of analyzing test items for differential item functioning (DIF) is to compute a statistic that conditions samples of examinees from different populations on an estimate of ability. This conditioning or matching by ability is intended to produce an appropriate statistic that is sensitive to true differences in item functioning,…

Descriptors: Blacks, College Entrance Examinations, Comparative Testing, Computer Simulation

Simultaneous DIF Amplification and Cancellation: Shealy-Stout's Test for DIF.

Download full text

Nandakumar, Ratna – 1992

The phenomenon of simultaneous differential item functioning (DIF) amplification and cancellation and the role of the SIBTEST computer program in detecting it were studied. A variety of simulated test data was generated for this purpose. In addition, the following real test data were used: (1) American College Testing program data for 2,115 males…

Descriptors: Black Students, Comparative Testing, Computer Simulation, Computer Software

An Empirical Investigation of Equating Stability in a Single and a Double Linkage Design with Small Sample Sizes Using Angoff Model IV.

Download full text

Du Bose, Pansy; Kromrey, Jeffrey D. – 1993

Empirical evidence is presented of the relative efficiency of two potential linkage plans to be used when equivalent test forms are being administered. Equating is a process by which scores on one form of a test are converted to scores on another form of the same test. A Monte Carlo study was conducted to examine equating stability and statistical…

Descriptors: Art Education, Comparative Testing, Computer Simulation, Equated Scores