Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 10 |
Descriptor
Classification | 15 |
Statistical Analysis | 15 |
Test Construction | 15 |
Test Items | 7 |
Models | 4 |
Correlation | 3 |
Difficulty Level | 3 |
Foreign Countries | 3 |
Prediction | 3 |
Research Methodology | 3 |
Scores | 3 |
More ▼ |
Source
ETS Research Report Series | 3 |
International Journal of… | 2 |
Crime & Delinquency | 1 |
Educational and Psychological… | 1 |
Eurasian Journal of… | 1 |
Learning Disability Quarterly | 1 |
Review of Higher Education | 1 |
Author
Sheehan, Kathleen M. | 3 |
Futagi, Yoko | 2 |
Kostin, Irene | 2 |
Awwad, Abeer | 1 |
Becker, Valerie | 1 |
Breyer, F. Jay | 1 |
Bustamante, Rebecca M. | 1 |
COX, RICHARD C. | 1 |
Demir, Ergul | 1 |
Fuller, Matthew B. | 1 |
Gierl, Mark J. | 1 |
More ▼ |
Publication Type
Journal Articles | 10 |
Reports - Research | 7 |
Reports - Descriptive | 3 |
Reports - Evaluative | 2 |
Numerical/Quantitative Data | 1 |
Education Level
Higher Education | 7 |
Postsecondary Education | 6 |
Elementary Secondary Education | 2 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 2 |
Kit of Reference Tests for… | 1 |
Minnesota Multiphasic… | 1 |
Test of English as a Foreign… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Liu, Ren – Educational and Psychological Measurement, 2018
Attribute structure is an explicit way of presenting the relationship between attributes in diagnostic measurement. The specification of attribute structures directly affects the classification accuracy resulted from psychometric modeling. This study provides a conceptual framework for understanding misspecifications of attribute structures. Under…
Descriptors: Diagnostic Tests, Classification, Test Construction, Relationship
Demir, Ergul – Eurasian Journal of Educational Research, 2018
Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…
Descriptors: College Students, Cheating, Test Construction, Student Behavior
Fuller, Matthew B.; Skidmore, Susan T.; Bustamante, Rebecca M.; Holzweiss, Peggy C. – Review of Higher Education, 2016
Although touted as beneficial to student learning, cultures of assessment have not been examined adequately using validated instruments. Using data collected from a stratified, random sample (N = 370) of U.S. institutional research and assessment directors, the models tested in this study provide empirical support for the value of using the…
Descriptors: Higher Education, Administrators, Evaluation Methods, Attitude Measures
Papageorgiou, Spiros; Morgan, Rick; Becker, Valerie – International Journal of Testing, 2015
The purpose of this study was to enhance the meaning of the scores of an English-language test by developing performance levels and descriptors for reporting overall test performance. The levels and descriptors were intended to accompany the total scale scores of TOEFL Junior® Standard, an international test of English as a second/foreign…
Descriptors: Language Proficiency, Language Tests, English (Second Language), Second Language Learning
Lufi, Dubi; Awwad, Abeer – Learning Disability Quarterly, 2013
The purpose of this article was to describe an initial step developing a new scale to identify individuals with learning disabilities (LD) and test anxiety. Eighty-eight students answered the "Minnesota Multiphasic Personality Inventory-2" (MMPI-2). The participants were drawn from the following three groups: (a) adults with LD and test…
Descriptors: Learning Disabilities, Test Anxiety, Comparative Analysis, Test Validity
Sheehan, Kathleen M. – ETS Research Report Series, 2015
The "TextEvaluator"® text analysis tool is a fully automated text complexity evaluation tool designed to help teachers, curriculum specialists, textbook publishers, and test developers select texts that are consistent with the text complexity guidelines specified in the Common Core State Standards.This paper documents the procedure used…
Descriptors: Scores, Common Core State Standards, Computer Software, Computational Linguistics
Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012
Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…
Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items
Breyer, F. Jay; Lewis, Charles – 1994
A single-administration classification reliability index is described that estimates the probability of consistently classifying examinees to mastery or nonmastery states as if those examinees had been tested with two alternate forms. The procedure is applicable to any test used for classification purposes, subdividing that test into two…
Descriptors: Classification, Cutting Scores, Objective Tests, Pass Fail Grading
Sheehan, Kathleen M.; Kostin, Irene; Futagi, Yoko – ETS Research Report Series, 2007
This paper explores alternative approaches for facilitating efficient, evidence-centered item development for a new type of verbal reasoning item developed for use on the GRE® General Test. Results obtained in two separate studies are reported. The first study documented the development and validation of a fully automated approach for locating the…
Descriptors: College Entrance Examinations, Graduate Study, Test Items, Item Analysis
COX, RICHARD C. – 1965
THE VALIDITY OF AN EDUCATIONAL ACHIEVEMENT TEST DEPENDS UPON THE CORRESPONDENCE BETWEEN SPECIFIED EDUCATIONAL OBJECTIVES AND THE EXTENT TO WHICH THESE OBJECTIVES ARE MEASURED BY THE EVALUATION INSTRUMENT. THIS STUDY IS DESIGNED TO EVALUATE THE EFFECT OF STATISTICAL ITEM SELECTION ON THE STRUCTURE OF THE FINAL EVALUATION INSTRUMENT AS COMPARED WITH…
Descriptors: Achievement Tests, Classification, Educational Objectives, Item Analysis
Olson, John F.; And Others – 1989
Traditionally, item difficulty has been defined in terms of the performance of examinees. For test development purposes, a more useful concept would be some kind of intrinsic item difficulty, defined in terms of the item's content, context, or characteristics and the task demands set by the item. In this investigation, the measurement literature…
Descriptors: Classification, Cluster Analysis, Difficulty Level, Educational Research
Steinke, Elisabeth – 1970
An approach to using the computer to assemble German tests is described. The purposes of the system would be: (1) an expansion of the bilingual lexical memory bank to list and store idioms of all degrees of difficulty, with frequency data and with complete and sophisticated retrieval possibility for assembly; (2) the creation of an…
Descriptors: Classification, Computational Linguistics, Computer Oriented Programs, German
Gottfredson, Stephen D.; Moriarty, Laura J. – Crime & Delinquency, 2006
Statistically based risk assessment devices are widely used in criminal justice settings. Their promise remains largely unfulfilled, however, because assumptions and premises requisite to their development and application are routinely ignored and/or violated. This article provides a brief review of the most salient of these assumptions and…
Descriptors: Risk, Justice, Criminals, Crime
Sheehan, Kathleen M.; Kostin, Irene; Futagi, Yoko; Hemat, Ramin; Zuckerman, Daniel – ETS Research Report Series, 2006
This paper describes the development, implementation, and evaluation of an automated system for predicting the acceptability status of candidate reading-comprehension stimuli extracted from a database of journal and magazine articles. The system uses a combination of classification and regression techniques to predict the probability that a given…
Descriptors: Automation, Prediction, Reading Comprehension, Classification
Madaus, George F.; And Others – 1971
Bloom's taxonomy of the cognitive domain consists of six major levels: Knowledge, Comprehension, Application, Analysis, Synthesis, and Evaluation. The purpose of this study is to construct a quantitative causal model for a set of tests designed to operationally define these six levels in order to further explore the validity of the cumulative…
Descriptors: Achievement Tests, Classification, Cognitive Objectives, Cognitive Processes