Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 7 |
Descriptor
Simulation | 18 |
Test Construction | 18 |
Test Reliability | 18 |
Computer Assisted Testing | 6 |
Test Items | 6 |
Test Validity | 6 |
Psychometrics | 4 |
Test Format | 4 |
Adaptive Testing | 3 |
Comparative Analysis | 3 |
Error of Measurement | 3 |
More ▼ |
Source
Author
Betz, Nancy E. | 2 |
Guo, Hongwen | 2 |
Weiss, David J. | 2 |
Afsoon Hassani Mehraban | 1 |
Akram Azad | 1 |
Beuchert, A. Kent | 1 |
Clyman, Stephen G. | 1 |
Cui, Zhongmin | 1 |
Delphine Franco | 1 |
Desmedt, John | 1 |
Eignor, Daniel R. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Iran | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design
Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023
Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…
Descriptors: Test Format, Equated Scores, Best Practices, Test Construction
Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025
Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…
Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression
Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020
With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…
Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Marzieh Pashmdarfard; Afsoon Hassani Mehraban; Narges Shafaroodi; Kamran Soltani Arabshahi; Soroor Parvizy; Akram Azad; Samaneh Karamali Esmaeili – Journal of Occupational Therapy Education, 2022
Fieldwork education is an integral part of the educational process in occupational therapy and assessing student competency at the end of fieldwork is important. The aim of this study was to design and conduct an Objective Structured Clinical Examination (OSCE) based on the Occupational Therapy Practice Framework (OTPF) for occupational therapy…
Descriptors: Occupational Therapy, Allied Health Occupations Education, Test Construction, Test Validity
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018
For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…
Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction
Zhang, Jinming – Psychometrika, 2013
In some popular test designs (including computerized adaptive testing and multistage testing), many item pairs are not administered to any test takers, which may result in some complications during dimensionality analyses. In this paper, a modified DETECT index is proposed in order to perform dimensionality analyses for response data from such…
Descriptors: Adaptive Testing, Simulation, Computer Assisted Testing, Test Reliability

Beuchert, A. Kent; Mendoza, Jorge L. – Journal of Educational Measurement, 1979
Ten item discrimination indices, across a variety of item analysis situations, were compared, based on the validities of tests constructed by using each of the indices to select 40 items from a 100-item pool. Item score data were generated by a computer program and included a simulation of guessing. (Author/CTM)
Descriptors: Item Analysis, Simulation, Statistical Analysis, Test Construction
Osborn, William C. – Training in Business and Industry, 1974
A chart of the major action points in the course of developing tests for training evaluation and for qualifying trainees provides a framework for discussing the problems and practices of test development. Performance standards are the most common sources of trouble; the test developer must have unequivocal standards to work from. (AJ)
Descriptors: Diagnostic Tests, Employment Qualifications, Job Training, Performance Tests
Desmedt, John; Yelon, Stephen – Performance and Instruction, 1992
Elementary performance tests and situational or simulation tests may be combined for comprehensive testing of open skills, i.e., a worker's competency in reacting to unpredictable situations. Elementary performance tests capture the professional skills whereas simulation tests retain realism and complexity and allow variation in responses.…
Descriptors: Achievement Tests, Guidelines, Industrial Training, Job Training

Clyman, Stephen G.; Orr, Nancy A. – Academic Medicine, 1990
The process proposed for the development and use of computer-based testing, including simulation and multiple-choice questions, as part of the National Board of Medical Examiners' certification sequence is outlined. Summary reports of first-phase pilot testing in six medical schools are appended. (MSE)
Descriptors: Computer Assisted Testing, Higher Education, Licensing Examinations (Professions), Medical Education
Joines, Richard C. – 1991
The development and validation of the General Management In-Basket (GMIB) is described. The GMIB is a theory-based generic in-basket simulation, designed to assess supervisory and management skills independent of any job classification. Three of the 15 in-basket items in the GMIB are critical and are scored on a 0-5 scale. The remaining 12 items…
Descriptors: Administrator Evaluation, Concurrent Validity, Factor Analysis, Interrater Reliability
Eignor, Daniel R.; Hambleton, Ronald K. – 1979
The purpose of the investigation was to obtain some relationships among (1) test lengths, (2) shape of domain-score distributions, (3) advancement scores, and (4) several criterion-referenced test score reliability and validity indices. The study was conducted using computer simulation methods. The values of variables under study were set to be…
Descriptors: Comparative Analysis, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores
Betz, Nancy E.; Weiss, David J. – 1975
A 40-item flexilevel test and a 40-item conventional test were compared using data obtained through (1) computer-administration of the two tests to three groups of college students, and (2) monte carlo simulation of test response patterns. Results indicated the flexilevel score distribution better reflected the underlying normal distribution of…
Descriptors: Ability, College Students, Comparative Analysis, Computer Oriented Programs
Betz, Nancy E.; Weiss, David J. – 1974
Monte Carlo simulation procedures were used to study the psychometric characteristics of two two-stage adaptive tests and a conventional "peaked" ability test. Results showed that scores yielded by both two-stage tests better reflected the normal distribution of underlying ability. Ability estimates yielded by one of the two stage tests…
Descriptors: Ability, Academic Ability, Adaptive Testing, Computers
Previous Page | Next Page ยป
Pages: 1 | 2