Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 6 |
Descriptor
Statistical Analysis | 27 |
Test Interpretation | 27 |
Test Items | 27 |
Test Construction | 11 |
Item Analysis | 10 |
Scores | 8 |
Item Response Theory | 7 |
Latent Trait Theory | 7 |
Mathematical Models | 7 |
Difficulty Level | 6 |
Equated Scores | 5 |
More ▼ |
Source
Author
Publication Type
Education Level
Secondary Education | 3 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Location
Alabama | 1 |
Indiana | 1 |
Kansas | 1 |
Massachusetts | 1 |
Michigan | 1 |
Minnesota | 1 |
Netherlands | 1 |
New Jersey | 1 |
Ohio | 1 |
Oregon | 1 |
United Kingdom (England) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 1 |
National Assessment of… | 1 |
Rokeach Value Survey | 1 |
SAT (College Admission Test) | 1 |
Strong Campbell Interest… | 1 |
Test of English as a Foreign… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Sinharay, Sandip – Grantee Submission, 2018
Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns
Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019
In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…
Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores
Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017
The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…
Descriptors: Test Items, Test Bias, Item Response Theory, Surveys
He, Qingping; Stockford, Ian; Meadows, Michelle – Oxford Review of Education, 2018
Results from Rasch analysis of GCSE and GCE A level data over a period of four years suggest that the standards of examinations in different subjects are not consistent in terms of the levels of the latent trait specified in the Rasch model required to achieve the same grades. Variability in statistical standards between subjects exists at both…
Descriptors: Foreign Countries, Exit Examinations, Intellectual Disciplines, Item Response Theory
Thummaphan, Phonraphee – ProQuest LLC, 2017
The present study aimed to represent the innovative assessments that support students' learning in STEM education through using the integrative framework for Cognitive Diagnostic Modeling (CDM). This framework is based on three components, cognition, observation, and interpretation (National Research Council, 2001). Specifically, this dissertation…
Descriptors: STEM Education, Cognitive Processes, Observation, Psychometrics
Traynor, Anne – Educational Assessment, 2017
Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…
Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum

Clauser, Brian E.; Mazor, Kathleen M. – Educational Measurement: Issues and Practice, 1998
This module prepares the reader to use statistical procedures to detect differentially functioning test items. The Mantel-Haenszel statistic, logistic regression, the SIBTEST procedure, the Standardization procedure, and various item response theory-based procedures are presented. Theoretical frameworks, strengths and weaknesses, and…
Descriptors: Item Bias, Item Response Theory, Statistical Analysis, Teaching Methods

Kuncel, Ruth Boutin – Educational and Psychological Measurement, 1977
The interaction of subjects with test items is investigated. It is suggested that psychometricians rely too heavily on inferences about the nature of the interaction. An approach to data analysis is proposed which is more directly related to this interaction. (JKS)
Descriptors: Higher Education, Item Analysis, Latent Trait Theory, Psychometrics

van der Linden, Wim J. – Applied Psychological Measurement, 1979
The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)
Descriptors: Difficulty Level, Item Sampling, Latent Trait Theory, Mathematical Models
McKinley, Robert L.; Reckase, Mark D. – 1982
Several special cases of the general Rasch model, varying in complexity, were investigated to determine whether they could successfully model realistic multidimensional item response data. Whether the parameters of the model could be readily interpreted was also investigated. The models investigated included: (1) the vector model; (2) the product…
Descriptors: Goodness of Fit, Item Analysis, Latent Trait Theory, Mathematical Models
Willingness to Answer Multiple-Choice Questions as Manifested Both in Genuine and in Nonsense Items.

Frary, Robert B.; Hutchinson, T.P. – Educational and Psychological Measurement, 1982
Alternate versions of Hutchinson's theory were compared, and one which implies the existence of partial knowledge was found to be better than one which implies that an appropriate measure of ability is obtained by applying the conventional correction for guessing. (Author/PN)
Descriptors: Guessing (Tests), Latent Trait Theory, Multiple Choice Tests, Scoring Formulas
McCall, Chester H., Jr.; Gardner, Suzanne – 1984
The Research Services of the National Education Association (NEA) conducted a nationwide teacher opinion poll (TOP) based upon a stratified disproportionate two-state cluster sample of classroom teachers. This research study was conducted to test the hypothesis that the order of presentation of items would make no difference in the conclusions…
Descriptors: Attitude Measures, Elementary Secondary Education, National Surveys, Statistical Analysis

Downing, Steven M.; Haladyna, Thomas M. – Applied Measurement in Education, 1997
An ideal process is outlined for test item development and the study of item responses to ensure that tests are sound. Qualitative and quantitative methods are used to assess the item-level validity evidence for high-stakes examinations. A checklist for assessment is provided. (SLD)
Descriptors: High Stakes Tests, Item Response Theory, Qualitative Research, Quality Control
Dimitrov, Dimiter M. – 1994
An approach is described that reveals the hierarchical test structure (HTS) based on the cognitive demands of the test items, and conducts a linear trait modeling by using the HST elements as item difficulty components. This approach, referred to as the Hierarchical Latent Trait Approach (HLTA), employs an algorithm that allows all test items to…
Descriptors: Algorithms, Cognitive Processes, Difficulty Level, Higher Education
Rogers, H. Jane; Hambleton, Ronald K. – 1987
Although item bias statistics are widely recommended for use in test development and test analysis work, problems arise in their interpretation. The purpose of the present research was to evaluate the validity of logistic test models and computer simulation methods for providing a frame of reference for item bias statistic interpretations.…
Descriptors: Computer Simulation, Evaluation Methods, Item Analysis, Latent Trait Theory
Previous Page | Next Page »
Pages: 1 | 2