Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 10 |
Descriptor
Test Construction | 16 |
Test Items | 13 |
Psychometrics | 8 |
Computer Assisted Testing | 6 |
Item Response Theory | 5 |
Models | 5 |
Foreign Countries | 4 |
Item Analysis | 4 |
Classification | 3 |
Mathematics Tests | 3 |
Achievement Tests | 2 |
More ▼ |
Source
Author
Gierl, Mark J. | 16 |
Lai, Hollis | 4 |
Cui, Ying | 2 |
Zhou, Jiawen | 2 |
Alves, Cecila | 1 |
Alves, Cecilia | 1 |
Boulais, André-Philippe | 1 |
Bulut, Okan | 1 |
De Champlain, André | 1 |
Ercikan, Kadriye | 1 |
Guo, Qi | 1 |
More ▼ |
Publication Type
Journal Articles | 16 |
Reports - Research | 8 |
Reports - Evaluative | 3 |
Reports - Descriptive | 2 |
Book/Product Reviews | 1 |
Guides - Classroom - Learner | 1 |
Information Analyses | 1 |
Opinion Papers | 1 |
Education Level
Elementary Secondary Education | 3 |
Higher Education | 3 |
Postsecondary Education | 3 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 6 | 1 |
Grade 9 | 1 |
Secondary Education | 1 |
Audience
Location
Canada | 2 |
South Korea | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Shin, Jinnie; Gierl, Mark J. – International Journal of Testing, 2022
Over the last five years, tremendous strides have been made in advancing the AIG methodology required to produce items in diverse content areas. However, the one content area where enormous problems remain unsolved is language arts, generally, and reading comprehension, more specifically. While reading comprehension test items can be created using…
Descriptors: Reading Comprehension, Test Construction, Test Items, Natural Language Processing
Gierl, Mark J.; Bulut, Okan; Guo, Qi; Zhang, Xinxin – Review of Educational Research, 2017
Multiple-choice testing is considered one of the most effective and enduring forms of educational assessment that remains in practice today. This study presents a comprehensive review of the literature on multiple-choice testing in education focused, specifically, on the development, analysis, and use of the incorrect options, which are also…
Descriptors: Multiple Choice Tests, Difficulty Level, Accuracy, Error Patterns
Gierl, Mark J.; Lai, Hollis; Hogan, James B.; Matovinovic, Donna – Journal of Applied Testing Technology, 2015
The demand for test items far outstrips the current supply. This increased demand can be attributed, in part, to the transition to computerized testing, but, it is also linked to dramatic changes in how 21st century educational assessments are designed and administered. One way to address this growing demand is with automatic item generation.…
Descriptors: Common Core State Standards, Test Items, Alignment (Education), Test Construction
Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016
Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…
Descriptors: Test Items, Test Construction, Psychometrics, Models
Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016
Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…
Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis
Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012
Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…
Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items
Gierl, Mark J.; Zhou, Jiawen; Alves, Cecila – Journal of Technology, Learning, and Assessment, 2008
An item model serves as an explicit representation of the variables in an assessment task. An item model includes the "stem", "options", and "auxiliary information". The "stem" is the part of an item which formulates context, content, and/or the question the examinee is required to answer. The "options" contain the alternative answers with one…
Descriptors: Classification, Test Items, Models, Test Construction
Gierl, Mark J.; Alves, Cecilia; Majeau, Renate Taylor – International Journal of Testing, 2010
The purpose of this study is to apply the attribute hierarchy method in an operational diagnostic mathematics program at Grades 3 and 6 to promote cognitive inferences about students' problem-solving skills. The attribute hierarchy method is a psychometric procedure for classifying examinees' test item responses into a set of structured attribute…
Descriptors: Test Items, Student Reaction, Diagnostic Tests, Psychometrics
Gierl, Mark J.; Cui, Ying; Zhou, Jiawen – Journal of Educational Measurement, 2009
The attribute hierarchy method (AHM) is a psychometric procedure for classifying examinees' test item responses into a set of structured attribute patterns associated with different components from a cognitive model of task performance. Results from an AHM analysis yield information on examinees' cognitive strengths and weaknesses. Hence, the AHM…
Descriptors: Test Items, True Scores, Psychometrics, Algebra
Gierl, Mark J.; Cui, Ying – Measurement: Interdisciplinary Research and Perspectives, 2008
One promising application of diagnostic classification models (DCM) is in the area of cognitive diagnostic assessment in education. However, the successful application of DCM in educational testing will likely come with a price--and this price may be in the form of new test development procedures and practices required to yield data that satisfy…
Descriptors: Educational Testing, Classification, Psychometrics, Test Construction

Gierl, Mark J.; Leighton, Jacqueline P.; Hunka, Stephen M. – Educational Measurement: Issues and Practice, 2000
Discusses the logic of the rule-space model (K. Tatsuoka, 1983) as it applies to test development and analysis. The rule-space model is a statistical method for classifying examinees' test item responses into a set of attribute-mastery patterns associated with different cognitive skills. Directs readers to a tutorial that may be downloaded. (SLD)
Descriptors: Item Analysis, Item Response Theory, Test Construction, Test Items

Gierl, Mark J.; Henderson, Diane; Jodoin, Michael; Klinger, Don – Journal of Experimental Education, 2001
Examined the influence of item parameter estimation errors across three item selection methods using the two- and three-parameter logistic item response theory (IRT) model. Tests created with the maximum no target and maximum target item selection procedures consistently overestimated the test information function. Tests created using the theta…
Descriptors: Estimation (Mathematics), Item Response Theory, Selection, Test Construction

Gierl, Mark J. – Applied Psychological Measurement, 1998
This book documents the research, development, and implementation efforts that allowed the U.S. Department of Defense to initiate the Computerized Adaptive Testing Armed Services Vocational Aptitude Battery Program for enlistment testing. Traces the history of this program over 30 years. (SLD)
Descriptors: Adaptive Testing, Aptitude Tests, Armed Forces, Computer Assisted Testing

Gierl, Mark J. – Journal of Educational Research, 1997
Investigated whether Bloom's taxonomy offers item writers an accurate model for anticipating students' cognitive processes used to solve items on a large-scale mathematics achievement test. Seventh graders thought aloud as they solved problems on the test. Researchers coded their cognitive processes using Bloom's taxonomy. Results suggest that…
Descriptors: Cognitive Processes, Grade 7, Junior High Schools, Mathematics Education
Ercikan, Kadriye; Gierl, Mark J.; McCreith, Tanya; Puhan, Gautam; Koh, Kim – Applied Measurement in Education, 2004
This research examined the degree of comparability and sources of incomparability of English and French versions of reading, mathematics, and science tests that were administered as part of a survey of achievement in Canada. The results point to substantial psychometric differences between the 2 language versions. Approximately 18% to 36% of the…
Descriptors: Foreign Countries, Psychometrics, Science Tests, French
Previous Page | Next Page »
Pages: 1 | 2