Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Educational Measurement:… | 28 |
Author
Hiscox, Michael D. | 2 |
Stone, Clement A. | 2 |
Alina A. von Davier | 1 |
Almusharraf, Norah | 1 |
Alotaibi, Hind | 1 |
Ames, Allison J. | 1 |
An, Ji | 1 |
Baker, Frank B. | 1 |
Bernstein, Lawrence | 1 |
Bolt, Daniel M. | 1 |
Brzezinski, Evelyn J. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 7 |
Students | 1 |
Location
Saudi Arabia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…
Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software
Bunch, Michael B. – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Michael Bunch provides an in-depth, step-by-step look at how standard setting is done. It does not focus on any specific procedure or methodology (e.g., modified Angoff, bookmark, and body of work) but on the practical tasks that must be completed for any standard setting activity. Dr. Bunch carries the…
Descriptors: Standard Setting, Cutting Scores, Scores, Reports
Levy, Roy – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Roy Levy describes Bayesian approaches to psychometric modeling. He discusses how Bayesian inference is a mechanism for reasoning in a probability-modeling framework and is well-suited to core problems in educational measurement: reasoning from student performances on an assessment to make inferences about their…
Descriptors: Bayesian Statistics, Psychometrics, Item Response Theory, Statistical Inference
Hancock, Gregory R.; An, Ji – Educational Measurement: Issues and Practice, 2018
In this ITEMS module, we frame the topic of scale reliability within a "confirmatory factor analysis" and "structural equation modeling" (SEM) context and address some of the limitations of Cronbach's a. This modeling approach has two major advantages: (1) it allows researchers to make explicit the relation between their items…
Descriptors: Reliability, Structural Equation Models, Factor Analysis, Correlation
Gregg, Nikole; Leventhal, Brian C. – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Nikole Gregg and Dr. Brian Leventhal discuss strategies to ensure data visualizations achieve graphical excellence. Data visualizations are commonly used by measurement professionals to communicate results to examinees, the public, educators, and other stakeholders. To do so effectively, it is important that these…
Descriptors: Data Analysis, Evidence Based Practice, Visualization, Test Results
Wang, Jue; Engelhard, George, Jr. – Educational Measurement: Issues and Practice, 2019
In this digital ITEMS module, Dr. Jue Wang and Dr. George Engelhard Jr. describe the Rasch measurement framework for the construction and evaluation of new measures and scales. From a theoretical perspective, they discuss the historical and philosophical perspectives on measurement with a focus on Rasch's concept of specific objectivity and…
Descriptors: Item Response Theory, Evaluation Methods, Measurement, Goodness of Fit
Almusharraf, Norah; Alotaibi, Hind – Educational Measurement: Issues and Practice, 2021
Committing errors is expected in the development of language acquisition and learning; however, there is limited research that contributes to the literature on the effect of gender of English as a foreign language (EFL) writing. This study explored the gender differences in EFL students' writing using two approaches: human evaluation and…
Descriptors: Gender Differences, Second Language Learning, Second Language Instruction, English (Second Language)
Ames, Allison J.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2015
Drawing valid inferences from item response theory (IRT) models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. This instructional module provides an overview of methods used for evaluating the fit of IRT models. Upon completing…
Descriptors: Item Response Theory, Goodness of Fit, Models, Evaluation Methods
Templin, Jonathan; Hoffman, Lesa – Educational Measurement: Issues and Practice, 2013
Diagnostic classification models (aka cognitive or skills diagnosis models) have shown great promise for evaluating mastery on a multidimensional profile of skills as assessed through examinee responses, but continued development and application of these models has been hindered by a lack of readily available software. In this article we…
Descriptors: Classification, Models, Language Tests, English (Second Language)
Lei, Pui-Wa; Wu, Qiong – Educational Measurement: Issues and Practice, 2007
Structural equation modeling (SEM) is a versatile statistical modeling tool. Its estimation techniques, modeling capacities, and breadth of applications are expanding rapidly. This module introduces some common terminologies. General steps of SEM are discussed along with important considerations in each step. Simple examples are provided to…
Descriptors: Structural Equation Models, Guidelines, Definitions, Computer Software

Stone, Clement A. – Educational Measurement: Issues and Practice, 1989
MicroCAT version 3.0--an integrated test development, administration, and analysis system--is reviewed in this first article of a series on testing software. A framework for comparing testing software is presented. The strength of this package lies in the development, banking, and administration of items composed of text and graphics. (SLD)
Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Data Analysis

Hsu, Tse-chi; Yu, Lifa – Educational Measurement: Issues and Practice, 1989
How computers are used to analyze item data is reviewed, and the information that existing item-analysis programs provide is described. Summaries of studies comparing the performance of some of these packages reveal some of their current limitations. Emphasis is on the usefulness to educational practice of these packages. (SLD)
Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Computer Uses in Education
Kim, Jee-Seon; Bolt, Daniel M. – Educational Measurement: Issues and Practice, 2007
The purpose of this ITEMS module is to provide an introduction to Markov chain Monte Carlo (MCMC) estimation for item response models. A brief description of Bayesian inference is followed by an overview of the various facets of MCMC algorithms, including discussion of prior specification, sampling procedures, and methods for evaluating chain…
Descriptors: Placement, Monte Carlo Methods, Markov Processes, Measurement

Bernstein, Lawrence; Liu, Mei – Educational Measurement: Issues and Practice, 1989
FINDIT Version 2.20--a database-retrieval software application to assist researchers/users in locating references in the "Journal of Educational Measurement" and "Educational Measurement: Issues and Practice" from 1975 to 1988--is reviewed. The system includes a cumulative index of citations and search/retrieval software.…
Descriptors: Computer Software, Databases, Indexes, Information Retrieval
Skaggs, Gary – Educational Measurement: Issues and Practice, 2004
Research on psychometric methods is heavily dependent on software. The quality, availability, and documentation of such software are critical to the advancement of the field. In 2000, an ad hoc committee of NCME recommended that NCME adopt policies that promote greater availability and better documentation of software. This article follows the ad…
Descriptors: Psychometrics, Computer Software, Computer Assisted Testing, Evaluation Methods
Previous Page | Next Page ยป
Pages: 1 | 2