ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	2

Source

Journal of Educational…

Author

Fitzpatrick, Anne R.	2
Amery D. Wu	1
Bennett, Randy Elliot	1
Calfee, Robert	1
Ebel, Robert L.	1
Ercikan, Kadriye	1
Haertel, Edward	1
Hardy, Roy A.	1
Hastings, C. Nicholas	1
Ito, Kyoko	1
Jake Stone	1
Linn, Robert L.	1
Mislevy, Robert J.	1
Sebrechts, Marc M.	1
Shun-Fu Hu	1
Sykes, Robert C.	1
Vispoel, Walter P.	1
Wardrop, James L.	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	5
Reports - Evaluative	4
Information Analyses	2
Book/Product Reviews	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Alabama High School…	1
Law School Admission Test	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

How Developments in Psychology and Technology Challenge Validity Argumentation

Peer reviewed

Direct link

Mislevy, Robert J. – Journal of Educational Measurement, 2016

Validity is the sine qua non of properties of educational assessment. While a theory of validity and a practical framework for validation has emerged over the past decades, most of the discussion has addressed familiar forms of assessment and psychological framings. Advances in digital technologies and in cognitive and social psychology have…

Descriptors: Test Validity, Technology, Cognitive Psychology, Social Psychology

Proposed Solutions to Two Problems of Test Construction.

Peer reviewed

Ebel, Robert L. – Journal of Educational Measurement, 1982

Reasonable and practical solutions to two major problems confronting the developer of any test of educational achievement (what to measure and how to measure it) are proposed, defended, and defined. (Author/PN)

Descriptors: Measurement Techniques, Objective Tests, Test Construction, Test Items

A Framework for Analyzing the Inference Structure of Educational Achievement Tests.

Peer reviewed

Wardrop, James L.; And Others – Journal of Educational Measurement, 1982

A structure for describing different approaches to testing is generated by identifying five dimensions along which tests differ: test uses, item generation, item revision, assessment of precision, and validation. These dimensions are used to profile tests of reading comprehension. Only norm-referenced achievement tests had an inference system…

Descriptors: Achievement Tests, Comparative Analysis, Educational Testing, Models

School Achievement: Thinking About What to Test.

Peer reviewed

Haertel, Edward; Calfee, Robert – Journal of Educational Measurement, 1983

The history of the relation between achievement tests and curriculum programs is reviewed, and it is concluded that content specialists are best qualified as sources of curricular goals to specify content, kinds of attainment, and standards. The specification of instructional intents is also considered from the perspective of modern cognition…

Descriptors: Achievement Tests, Cognitive Processes, Curriculum, Instructional Development

Test Review: Basic Achievement Skills Individual Screener.

Peer reviewed

Fitzpatrick, Anne R. – Journal of Educational Measurement, 1984

This article reviews the Basic Achievement Skills Individual Screener (BASIS), an individually administered achievement battery that consists of skills tests in reading, mathematics, and spelling as well as an optional writing exercise. BASIS is found to be an effective and efficient means of assessing basic skills. (Author/EGS)

Descriptors: Achievement Tests, Basic Skills, Screening Tests, Test Construction

Technical Issues in Large-Scale Performance Assessment [Book Review].

Peer reviewed

Sykes, Robert C.; Ito, Kyoko; Fitzpatrick, Anne R.; Ercikan, Kadriye – Journal of Educational Measurement, 1997

The five chapters of this report provide resources that deal with the validity, generalizability, comparability, performance standards, and fairness, equity, and bias of performance assessments. The book is written for experienced educational measurement practitioners, although an extensive familiarity with performance assessment is not required.…

Descriptors: Educational Assessment, Measurement Techniques, Performance Based Assessment, Standards

Measuring Instructional Validity: A Report of an Instructional Validity Study for the Alabama High School Graduation Examination.

Peer reviewed

Hardy, Roy A. – Journal of Educational Measurement, 1984

To determine to what extent competencies to be measured by the Alabama High School Graduation Examination were being taught in the Alabama public schools, a survey was conducted of teachers of grades 7, 8, 9, and 10. Competencies that were not being taught are identified and possible explanations are outlined. (Author/EGS)

Descriptors: Academic Standards, Competency Based Education, Evaluation Criteria, Graduation Requirements

A Meta Analysis of the Validity of Predictors of Performance in Law School.

Peer reviewed

Linn, Robert L.; Hastings, C. Nicholas – Journal of Educational Measurement, 1984

Using predictive validity studies of the Law School Admissions Test (LSAT) and the undergraduate grade-point average (UGPA), this study examined the large variation in the magnitude of the validity coefficients across schools. LSAT standard deviation and correlation between LSAT and UGPA accounted for 58.5 percent of the variability. (Author/EGS)

Descriptors: Academic Achievement, College Applicants, College Entrance Examinations, Grade Point Average

Computerized Adaptive and Fixed-Item Testing of Music Listening Skill: A Comparison of Efficiency, Precision, and Concurrent Validity.

Peer reviewed

Vispoel, Walter P.; And Others – Journal of Educational Measurement, 1997

Efficiency, precision, and concurrent validity of results from adaptive and fixed-item music listening tests were studied using: (1) 2,200 simulated examinees; (2) 204 live examinees; and (3) 172 live examinees. Results support the usefulness of adaptive tests for measuring skills that require aurally produced items. (SLD)

Descriptors: Adaptive Testing, Adults, College Students, Comparative Analysis

A Computer-Based Task for Measuring the Representational Component of Quantitative Proficiency.

Peer reviewed

Bennett, Randy Elliot; Sebrechts, Marc M. – Journal of Educational Measurement, 1997

A computer-delivered problem-solving task based on cognitive research literature was developed and its validity for graduate admissions assessment was studied with 107 undergraduates. Use of the test, which asked examinees to sort word-problem stems by prototypes, was supported by the findings. (SLD)

Descriptors: Admission (School), College Entrance Examinations, Computer Assisted Testing, Graduate Study

Test Use	11
Test Validity	11
Test Construction	5
Achievement Tests	3
Higher Education	3
Measurement Techniques	3
Test Reliability	3
Testing Problems	3
College Entrance Examinations	2
Comparative Analysis	2
Computer Assisted Testing	2
Educational Assessment	2
Test Items	2
Academic Achievement	1
Academic Standards	1
Accuracy	1
Adaptive Testing	1
Admission (School)	1
Adults	1
Basic Skills	1
Cognitive Processes	1
Cognitive Psychology	1
College Applicants	1
College Students	1
Competency Based Education	1
More ▼