Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Statistical Analysis | 7 |
Test Format | 7 |
Equated Scores | 3 |
Item Response Theory | 3 |
Test Items | 3 |
Differences | 2 |
Educational Testing | 2 |
Evaluation Criteria | 2 |
Simulation | 2 |
Test Construction | 2 |
Alternative Assessment | 1 |
More ▼ |
Source
ProQuest LLC | 7 |
Author
Duong, Minh Quang | 1 |
Jiajing Huang | 1 |
Joseph, Dane Christian | 1 |
Murphy, Peter V. | 1 |
Rawlusyk, Patricia | 1 |
Tian, Feng | 1 |
Tingir, Seyfullah | 1 |
Publication Type
Dissertations/Theses -… | 7 |
Education Level
Higher Education | 1 |
Audience
Location
Florida | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test Anxiety Inventory | 1 |
What Works Clearinghouse Rating
Jiajing Huang – ProQuest LLC, 2022
The nonequivalent-groups anchor-test (NEAT) data-collection design is commonly used in large-scale assessments. Under this design, different test groups take different test forms. Each test form has its own unique items and all test forms share a set of common items. If item response theory (IRT) models are applied to analyze the test data, the…
Descriptors: Item Response Theory, Test Format, Test Items, Test Construction
Tingir, Seyfullah – ProQuest LLC, 2019
Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…
Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability
Rawlusyk, Patricia – ProQuest LLC, 2016
Assessment is a fundamental element in successful teaching and learning. However, few research studies have examined the assessment practices implemented by faculty in higher education. It is believed that testing has become the primary method of assessment, which could adversely impact student learning. The purpose of this descriptive…
Descriptors: Educational Assessment, Evaluation Criteria, Evaluation Methods, College Faculty
Murphy, Peter V. – ProQuest LLC, 2014
The emergence of standards-based curriculums has resulted in an increased frequency of student testing, including high-stakes testing. Of students who take tests, up to 65% may experience test anxiety, which can have negative effects on student outcomes. For this reason, the purpose of this single-group, repeated measures design, quantitative…
Descriptors: Test Anxiety, Statistical Analysis, Layout (Publications), Test Format
Duong, Minh Quang – ProQuest LLC, 2011
Testing programs often use multiple test forms of the same test to control item exposure and to ensure test security. Although test forms are constructed to be as similar as possible, they often differ. Test equating techniques are those statistical methods used to adjust scores obtained on different test forms of the same test so that they are…
Descriptors: Equated Scores, Statistical Analysis, Item Response Theory, Evaluation Criteria
Tian, Feng – ProQuest LLC, 2011
There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…
Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis
Joseph, Dane Christian – ProQuest LLC, 2010
Multiple-choice item-writing guideline research is in its infancy. Haladyna (2004) calls for a science of item-writing guideline research. The purpose of this study is to respond to such a call. The purpose of this study was to examine the impact of student ability and method for varying the location of correct answers in classroom multiple-choice…
Descriptors: Evidence, Test Format, Guessing (Tests), Program Effectiveness