ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	7

Descriptor

Data Collection	17
Statistical Analysis	5
Equated Scores	4
Comparative Analysis	3
Evaluation Methods	3
Methods	3
Models	3
Academic Achievement	2
Achievement Tests	2
College Faculty	2
Computation	2
Evaluators	2
High Schools	2
Multidimensional Scaling	2
National Surveys	2
Research Methodology	2
Sampling	2
Scores	2
Student Evaluation	2
Student Records	2
Tables (Data)	2
Test Interpretation	2
Test Items	2
Test Theory	2
Academic Records	1
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	11
Reports - Research	5
Reports - Evaluative	3
Reports - Descriptive	2
Opinion Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
Iowa Tests of Basic Skills	1
National Teacher Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Argument-Based Approach to Validity: Developing a Living Document and Incorporating Preregistration

Peer reviewed

Direct link

Daria Gerasimova – Journal of Educational Measurement, 2024

I propose two practical advances to the argument-based approach to validity: developing a living document and incorporating preregistration. First, I present a potential structure for the living document that includes an up-to-date summary of the validity argument. As the validation process may span across multiple studies, the living document…

Descriptors: Validity, Documentation, Methods, Research Reports

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

The Effects of Incomplete Rating Designs in Combination with Rater Effects

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Journal of Educational Measurement, 2019

Researchers have explored a variety of topics related to identifying and distinguishing among specific types of rater effects, as well as the implications of different types of incomplete data collection designs for rater-mediated assessments. In this study, we used simulated data to examine the sensitivity of latent trait model indicators of…

Descriptors: Rating Scales, Models, Evaluators, Data Collection

Comments on "Some Conceptual Issues in Observed-Score Equating" by Wim J. van der Linden

Peer reviewed

Direct link

Bradlow, Eric T. – Journal of Educational Measurement, 2013

The van der Linden article (this issue) provides a roadmap for future research in equating. My belief is that the roadmap begins and ends with collecting auxiliary data that can be utilized to provide improved equating, especially when data are sparse or equating beyond simple moments is desired.

Descriptors: Equated Scores, Data Collection, Statistical Analysis, Research

Using Networks to Visualize and Analyze Process Data for Educational Assessment

Peer reviewed

Direct link

Zhu, Mengxiao; Shu, Zhan; von Davier, Alina A. – Journal of Educational Measurement, 2016

New technology enables interactive and adaptive scenario-based tasks (SBTs) to be adopted in educational measurement. At the same time, it is a challenging problem to build appropriate psychometric models to analyze data collected from these tasks, due to the complexity of the data. This study focuses on process data collected from SBTs. We…

Descriptors: Measurement, Data Collection, National Competency Tests, Scoring Rubrics

Some Conceptual Issues in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2013

In spite of all of the technical progress in observed-score equating, several of the more conceptual aspects of the process still are not well understood. As a result, the equating literature struggles with rather complex criteria of equating, lack of a test-theoretic foundation, confusing terminology, and ad hoc analyses. A return to Lord's…

Descriptors: Equated Scores, Statistical Analysis, Computation, Data Collection

Observed Score Linear Equating with Covariates

Peer reviewed

Direct link

Branberg, Kenny; Wiberg, Marie – Journal of Educational Measurement, 2011

This paper examined observed score linear equating in two different data collection designs, the equivalent groups design and the nonequivalent groups design, when information from covariates (i.e., background variables correlated with the test scores) was included. The main purpose of the study was to examine the effect (i.e., bias, variance, and…

Descriptors: Equated Scores, Data Collection, Models, Accuracy

A Closer Look at the Accuracy of Alternative Data-Collection Methods for Multidimensional Scaling

Peer reviewed

Subkoviak, Michael; Roecks, Alan L. – Journal of Educational Measurement, 1976

Three different methods of data collection were examined in which subjects judged proximity between object pairs. Significant differences in accuracy were found among the three methods, presumably due to differences in the extent to which subjects are able to describe their perceptions under the various methods. (Author/RC)

Descriptors: College Students, Data Collection, Distance, Geographic Location

Determining the Characteristics of the Ideal Professor: An Alternative Approach

Peer reviewed

Subkoviak, Michael J.; Levin, Joel R. – Journal of Educational Measurement, 1974

A free-response method of data collection (questionnaires), in conjunction with nonmetric multidimensional scaling, produced results highly similar to those of a previous study, i.e., that an effective college teacher could be characterized in terms of "research,""teaching," and "service to the university." (Author/RC)

Descriptors: College Faculty, Data Collection, Evaluation Methods, Multidimensional Scaling

A Classification Scheme for Methods of Using Student Data to Assess School Effectiveness

Peer reviewed

Marco, Gary L.; And Others – Journal of Educational Measurement, 1976

Special emphasis is given to the kinds of control that can be exercised over initial status, including the use of proxy input data. A rationale for the classification scheme is developed, based on (1) three one-shot, one cross-sectional, and two longitudinal data types and (2) two types of referencing: criterion referencing and norm referencing.…

Descriptors: Classification, Data Collection, Evaluation Methods, Methods

Assessment of Nonresponse Bias in Sample Surveys: An Example from National Assessment

Peer reviewed

Rogers, W. Todd; And Others – Journal of Educational Measurement, 1977

The bias attributable to nonresponse in population estimates in the field of education was studied. Data were collected from responses to mathematics and science exercises administered by the National Assessment of Educational Progress to a probability sample of 17-year olds, as well as a probability sample selected from nonrespondents.…

Descriptors: Attrition (Research Studies), Data Collection, High Schools, National Surveys

An Investigation of an Extension of Item Sampling Which Yields Individual Scores

Peer reviewed

Bunda, Mary Anne – Journal of Educational Measurement, 1973

Procedures to be applicable in situations in which large numbers of individuals are tested or in situations where multiple measures are taken. (Author/CB)

Descriptors: Data Collection, Group Norms, Individual Testing, Item Sampling

Problems of Analyst and Observer Agreement in Naturalistic Narrative Data.

Peer reviewed

Scott, M. M.; Hatfield, James G. – Journal of Educational Measurement, 1985

Differences in agreement between observers and analysts of naturalistic narrative data cause problems in observation research. This paper discusses the advantages and disadvantages of several possible solutions. (Author/GDC)

Descriptors: Behavioral Science Research, Data Analysis, Data Collection, Interrater Reliability

Narrative Format Testing Reports and Traditional Testing Reports: A Comparative Study

Peer reviewed

Mathews, Walter M. – Journal of Educational Measurement, 1973

This article reports a comparative study of teacher acceptance of two kinds of testing reports that were generated for Form A of the Iowa Tests of Basic Skills at the fourth-grade level. (Editor)

Descriptors: Academic Achievement, Academic Records, Data Collection, Elementary Schools

Influence of Type of Judge, Normative Information, and Discussion on Standards Recommended for the National Teacher Examinations.

Peer reviewed

Busch, John Christian; Jaeger, Richard M. – Journal of Educational Measurement, 1990

The effects of using recommended data collection procedures on median recommended test standards, variability of recommended test standards, and reliability of recommended standards for 7 subtests of the National Teacher Examinations Communications Skills and General Knowledge Tests were explored, using 236 evaluators (75 public school teachers…

Descriptors: College Faculty, Data Collection, Evaluators, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Baldwin, Peter	1
Bradlow, Eric T.	1
Branberg, Kenny	1
Braun, Henry I.	1
Bunda, Mary Anne	1
Busch, John Christian	1
Clauser, Brian E.	1
Daria Gerasimova	1
Harris, Deborah J.	1
Hatfield, James G.	1
Jaeger, Richard M.	1
Jones, Eli	1
Levin, Joel R.	1
Marco, Gary L.	1
Mathews, Walter M.	1
Roecks, Alan L.	1
Rogers, W. Todd	1
Scott, M. M.	1
Shu, Zhan	1
Subkoviak, Michael	1
Subkoviak, Michael J.	1
Wiberg, Marie	1
Wind, Stefanie A.	1
Zhu, Mengxiao	1
More ▼