NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Daria Gerasimova – Journal of Educational Measurement, 2024
I propose two practical advances to the argument-based approach to validity: developing a living document and incorporating preregistration. First, I present a potential structure for the living document that includes an up-to-date summary of the validity argument. As the validation process may span across multiple studies, the living document…
Descriptors: Validity, Documentation, Methods, Research Reports
Peer reviewed Peer reviewed
Direct linkDirect link
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Jones, Eli – Journal of Educational Measurement, 2019
Researchers have explored a variety of topics related to identifying and distinguishing among specific types of rater effects, as well as the implications of different types of incomplete data collection designs for rater-mediated assessments. In this study, we used simulated data to examine the sensitivity of latent trait model indicators of…
Descriptors: Rating Scales, Models, Evaluators, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Bradlow, Eric T. – Journal of Educational Measurement, 2013
The van der Linden article (this issue) provides a roadmap for future research in equating. My belief is that the roadmap begins and ends with collecting auxiliary data that can be utilized to provide improved equating, especially when data are sparse or equating beyond simple moments is desired.
Descriptors: Equated Scores, Data Collection, Statistical Analysis, Research
Peer reviewed Peer reviewed
Direct linkDirect link
Zhu, Mengxiao; Shu, Zhan; von Davier, Alina A. – Journal of Educational Measurement, 2016
New technology enables interactive and adaptive scenario-based tasks (SBTs) to be adopted in educational measurement. At the same time, it is a challenging problem to build appropriate psychometric models to analyze data collected from these tasks, due to the complexity of the data. This study focuses on process data collected from SBTs. We…
Descriptors: Measurement, Data Collection, National Competency Tests, Scoring Rubrics
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J. – Journal of Educational Measurement, 2013
In spite of all of the technical progress in observed-score equating, several of the more conceptual aspects of the process still are not well understood. As a result, the equating literature struggles with rather complex criteria of equating, lack of a test-theoretic foundation, confusing terminology, and ad hoc analyses. A return to Lord's…
Descriptors: Equated Scores, Statistical Analysis, Computation, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Branberg, Kenny; Wiberg, Marie – Journal of Educational Measurement, 2011
This paper examined observed score linear equating in two different data collection designs, the equivalent groups design and the nonequivalent groups design, when information from covariates (i.e., background variables correlated with the test scores) was included. The main purpose of the study was to examine the effect (i.e., bias, variance, and…
Descriptors: Equated Scores, Data Collection, Models, Accuracy
Peer reviewed Peer reviewed
Subkoviak, Michael; Roecks, Alan L. – Journal of Educational Measurement, 1976
Three different methods of data collection were examined in which subjects judged proximity between object pairs. Significant differences in accuracy were found among the three methods, presumably due to differences in the extent to which subjects are able to describe their perceptions under the various methods. (Author/RC)
Descriptors: College Students, Data Collection, Distance, Geographic Location
Peer reviewed Peer reviewed
Subkoviak, Michael J.; Levin, Joel R. – Journal of Educational Measurement, 1974
A free-response method of data collection (questionnaires), in conjunction with nonmetric multidimensional scaling, produced results highly similar to those of a previous study, i.e., that an effective college teacher could be characterized in terms of "research,""teaching," and "service to the university." (Author/RC)
Descriptors: College Faculty, Data Collection, Evaluation Methods, Multidimensional Scaling
Peer reviewed Peer reviewed
Marco, Gary L.; And Others – Journal of Educational Measurement, 1976
Special emphasis is given to the kinds of control that can be exercised over initial status, including the use of proxy input data. A rationale for the classification scheme is developed, based on (1) three one-shot, one cross-sectional, and two longitudinal data types and (2) two types of referencing: criterion referencing and norm referencing.…
Descriptors: Classification, Data Collection, Evaluation Methods, Methods
Peer reviewed Peer reviewed
Rogers, W. Todd; And Others – Journal of Educational Measurement, 1977
The bias attributable to nonresponse in population estimates in the field of education was studied. Data were collected from responses to mathematics and science exercises administered by the National Assessment of Educational Progress to a probability sample of 17-year olds, as well as a probability sample selected from nonrespondents.…
Descriptors: Attrition (Research Studies), Data Collection, High Schools, National Surveys
Peer reviewed Peer reviewed
Bunda, Mary Anne – Journal of Educational Measurement, 1973
Procedures to be applicable in situations in which large numbers of individuals are tested or in situations where multiple measures are taken. (Author/CB)
Descriptors: Data Collection, Group Norms, Individual Testing, Item Sampling
Peer reviewed Peer reviewed
Scott, M. M.; Hatfield, James G. – Journal of Educational Measurement, 1985
Differences in agreement between observers and analysts of naturalistic narrative data cause problems in observation research. This paper discusses the advantages and disadvantages of several possible solutions. (Author/GDC)
Descriptors: Behavioral Science Research, Data Analysis, Data Collection, Interrater Reliability
Peer reviewed Peer reviewed
Mathews, Walter M. – Journal of Educational Measurement, 1973
This article reports a comparative study of teacher acceptance of two kinds of testing reports that were generated for Form A of the Iowa Tests of Basic Skills at the fourth-grade level. (Editor)
Descriptors: Academic Achievement, Academic Records, Data Collection, Elementary Schools
Peer reviewed Peer reviewed
Busch, John Christian; Jaeger, Richard M. – Journal of Educational Measurement, 1990
The effects of using recommended data collection procedures on median recommended test standards, variability of recommended test standards, and reliability of recommended standards for 7 subtests of the National Teacher Examinations Communications Skills and General Knowledge Tests were explored, using 236 evaluators (75 public school teachers…
Descriptors: College Faculty, Data Collection, Evaluators, Higher Education
Previous Page | Next Page ยป
Pages: 1  |  2