ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	7

Descriptor

Simulation	18
Test Construction	18
Test Reliability	18
Computer Assisted Testing	6
Test Items	6
Test Validity	6
Psychometrics	4
Test Format	4
Adaptive Testing	3
Comparative Analysis	3
Error of Measurement	3
Item Analysis	3
Item Response Theory	3
Measurement Techniques	3
Occupational Tests	3
Performance Tests	3
Ability	2
Bayesian Statistics	2
Cutting Scores	2
Difficulty Level	2
Interrater Reliability	2
Item Banks	2
Job Performance	2
Job Training	2
Mastery Tests	2
More ▼

Source

ETS Research Report Series	2
Academic Medicine	1
Education and Information…	1
European Journal of Education	1
Journal of Educational…	1
Journal of Occupational…	1
Measurement:…	1
Performance and Instruction	1
Psychometrika	1
Training in Business and…	1

Publication Type

Reports - Research	13
Journal Articles	10
Speeches/Meeting Papers	4
Tests/Questionnaires	2
Collected Works - General	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Iran

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Capturing Competence: The Design, Evaluation, and Implementation of a Video-Based Instrument for Assessing Verbal Aggression Management Competence

Peer reviewed

Direct link

Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025

Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…

Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression

Using Existing Data to Inform Development of New Item Types. Research Report. ETS RR-20-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020

With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…

Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Validation of Objective Structured Clinical Examination (OSCE) Based on the Occupational Therapy Practice Framework (OTPF): A Pilot Study

Peer reviewed
PDF on ERIC

Download full text

Marzieh Pashmdarfard; Afsoon Hassani Mehraban; Narges Shafaroodi; Kamran Soltani Arabshahi; Soroor Parvizy; Akram Azad; Samaneh Karamali Esmaeili – Journal of Occupational Therapy Education, 2022

Fieldwork education is an integral part of the educational process in occupational therapy and assessing student competency at the end of fieldwork is important. The aim of this study was to design and conduct an Objective Structured Clinical Examination (OSCE) based on the Occupational Therapy Practice Framework (OTPF) for occupational therapy…

Descriptors: Occupational Therapy, Allied Health Occupations Education, Test Construction, Test Validity

A Simulation-Based Method for Finding the Optimal Number of Options for Multiple-Choice Items on a Test. Research Report. ETS RR-18-22

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018

For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…

Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction

A Procedure for Dimensionality Analyses of Response Data from Various Test Designs

Peer reviewed

Direct link

Zhang, Jinming – Psychometrika, 2013

In some popular test designs (including computerized adaptive testing and multistage testing), many item pairs are not administered to any test takers, which may result in some complications during dimensionality analyses. In this paper, a modified DETECT index is proposed in order to perform dimensionality analyses for response data from such…

Descriptors: Adaptive Testing, Simulation, Computer Assisted Testing, Test Reliability

A Monte Carlo Comparison of Ten Item Discrimination Indices.

Peer reviewed

Beuchert, A. Kent; Mendoza, Jorge L. – Journal of Educational Measurement, 1979

Ten item discrimination indices, across a variety of item analysis situations, were compared, based on the validities of tests constructed by using each of the indices to select 40 items from a 100-item pool. Item score data were generated by a computer program and included a simulation of guessing. (Author/CTM)

Descriptors: Item Analysis, Simulation, Statistical Analysis, Test Construction

Framework for Performance Testing

Osborn, William C. – Training in Business and Industry, 1974

A chart of the major action points in the course of developing tests for training evaluation and for qualifying trainees provides a framework for discussing the problems and practices of test development. Performance standards are the most common sources of trouble; the test developer must have unequivocal standards to work from. (AJ)

Descriptors: Diagnostic Tests, Employment Qualifications, Job Training, Performance Tests

Comprehensive Open Skill Test Design.

Desmedt, John; Yelon, Stephen – Performance and Instruction, 1992

Elementary performance tests and situational or simulation tests may be combined for comprehensive testing of open skills, i.e., a worker's competency in reacting to unpredictable situations. Elementary performance tests capture the professional skills whereas simulation tests retain realism and complexity and allow variation in responses.…

Descriptors: Achievement Tests, Guidelines, Industrial Training, Job Training

Status Report on the NBME's Computer-Based Testing.

Peer reviewed

Clyman, Stephen G.; Orr, Nancy A. – Academic Medicine, 1990

The process proposed for the development and use of computer-based testing, including simulation and multiple-choice questions, as part of the National Board of Medical Examiners' certification sequence is outlined. Summary reports of first-phase pilot testing in six medical schools are appended. (MSE)

Descriptors: Computer Assisted Testing, Higher Education, Licensing Examinations (Professions), Medical Education

Traditional In-Baskets vs. the General Management In-Basket (GMIB).

Download full text

Joines, Richard C. – 1991

The development and validation of the General Management In-Basket (GMIB) is described. The GMIB is a theory-based generic in-basket simulation, designed to assess supervisory and management skills independent of any job classification. Three of the 15 in-basket items in the GMIB are critical and are scored on a 0-5 scale. The remaining 12 items…

Descriptors: Administrator Evaluation, Concurrent Validity, Factor Analysis, Interrater Reliability

Effects of Test Length and Advancement Score on Several Criterion-Referenced Test Reliability and Validity Indices. Laboratory of Psychometric and Evaluation Research Report No. 86.

Download full text

Eignor, Daniel R.; Hambleton, Ronald K. – 1979

The purpose of the investigation was to obtain some relationships among (1) test lengths, (2) shape of domain-score distributions, (3) advancement scores, and (4) several criterion-referenced test score reliability and validity indices. The study was conducted using computer simulation methods. The values of variables under study were set to be…

Descriptors: Comparative Analysis, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores

Empirical and Simulation Studies of Flexilevel Ability Testing. Research Report No. 75-3.

Download full text

Betz, Nancy E.; Weiss, David J. – 1975

A 40-item flexilevel test and a 40-item conventional test were compared using data obtained through (1) computer-administration of the two tests to three groups of college students, and (2) monte carlo simulation of test response patterns. Results indicated the flexilevel score distribution better reflected the underlying normal distribution of…

Descriptors: Ability, College Students, Comparative Analysis, Computer Oriented Programs

Simulation Studies of Two-Stage Ability Testing. Research Report 74-4.

Download full text

Betz, Nancy E.; Weiss, David J. – 1974

Monte Carlo simulation procedures were used to study the psychometric characteristics of two two-stage adaptive tests and a conventional "peaked" ability test. Results showed that scores yielded by both two-stage tests better reflected the normal distribution of underlying ability. Ability estimates yielded by one of the two stage tests…

Descriptors: Ability, Academic Ability, Adaptive Testing, Computers

Previous Page | Next Page »

Pages: 1 | 2

Betz, Nancy E.	2
Guo, Hongwen	2
Weiss, David J.	2
Afsoon Hassani Mehraban	1
Akram Azad	1
Beuchert, A. Kent	1
Clyman, Stephen G.	1
Cui, Zhongmin	1
Delphine Franco	1
Desmedt, John	1
Eignor, Daniel R.	1
Frankel, Lois	1
Gelbal, Selahattin	1
Hambleton, Ronald K.	1
He, Yong	1
Huynh, Huynh	1
Joines, Richard C.	1
Jones, Michael H.	1
Kamran Soltani Arabshahi	1
Kyllonen, Patrick	1
Ling, Guangming	1
Martin Valcke	1
Marzieh Pashmdarfard	1
Mendoza, Jorge L.	1
More ▼