ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Models	11
Task Analysis	11
Test Construction	11
Test Items	6
Scores	5
Item Analysis	4
Test Validity	4
Evaluation Methods	3
Criterion Referenced Tests	2
Decision Making	2
Elementary School Students	2
Foreign Countries	2
Guidelines	2
Interrater Reliability	2
Item Response Theory	2
Language Tests	2
Second Language Learning	2
Statistical Analysis	2
Tables (Data)	2
Test Format	2
Test Reliability	2
Test Theory	2
Test Use	2
Testing Problems	2
Vocabulary	2
More ▼

Source

AERA Online Paper Repository	1
Computers & Education	1
Educational Assessment	1
Instructional Science	1
International Educational…	1
Journal of Career Assessment	1
Online Submission	1
ProQuest LLC	1

Publication Type

Reports - Research	5
Journal Articles	4
Speeches/Meeting Papers	3
Dissertations/Theses -…	2
Reports - Descriptive	2
Reports - Evaluative	1

Education Level

Elementary Education	2
Grade 5	1
Grade 6	1
Grade 7	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1

Audience

Administrators	1
Practitioners	1

Location

Georgia	1
Hong Kong	1
Idaho	1
Massachusetts	1
Missouri	1
Oregon	1
Washington	1

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

Age, Task Characteristics, and Acoustic Indicators of Engagement: Investigations into the Validity of a Technology-Enhanced Speaking Test for Young Language Learners

Download full text

Edward Paul Getman – Online Submission, 2020

Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement

Process Modeling: A Structured Approach to Assessing Complex Decision Making

Peer reviewed

Direct link

Cook, Robert J.; Durning, Steven J. – AERA Online Paper Repository, 2016

In an effort to better align item development to goals of assessing higher-order tasks and decision making, complex decision trees were developed to follow clinical reasoning scripts and used as models on which multiple-choice questions could be built. This approach is compatible with best-practice assessment frameworks like Evidence Centered…

Descriptors: Multiple Choice Tests, Decision Making, Models, Task Analysis

Construct Definition Using Cognitively Based Evidence: A Framework for Practice

Peer reviewed

Direct link

Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Jung, EunJu; Liu, Kimy; Geller, Josh – Educational Assessment, 2013

In this article, we highlight the need for a precisely defined construct in score-based validation and discuss the contribution of cognitive theories to accurately and comprehensively defining the construct. We propose a framework for integrating cognitively based theoretical and empirical evidence to specify and evaluate the construct. We apply…

Descriptors: Test Validity, Construct Validity, Scores, Evidence

Justifying the Use of a Second Language Oral Test as an Exit Test in Hong Kong: An Application of Assessment Use Argument Framework

Direct link

Jia, Yujie – ProQuest LLC, 2013

This study employed Bachman and Palmer's (2010) Assessment Use Argument framework to investigate to what extent the use of a second language oral test as an exit test in a Hong Kong university can be justified. It also aimed to help test developers of this oral test identify the most critical areas in the current test design that might need…

Descriptors: Test Use, Language Tests, Oral Language, Second Language Learning

A Demands-Resources Model of Work Pressure in IT Student Task Groups

Peer reviewed

Direct link

Wilson, E. Vance; Sheetz, Steven D. – Computers & Education, 2010

This paper presents an initial test of the group task demands-resources (GTD-R) model of group task performance among IT students. We theorize that demands and resources in group work influence formation of perceived group work pressure (GWP) and that heightened levels of GWP inhibit group task performance. A prior study identified 11 factors…

Descriptors: Burnout, Group Dynamics, Models, Group Activities

A Modular Approach to Proficiency Testing.

Download full text

Stephenson, Robert W.; And Others – 1973

A new, more specific language for describing work activities, based upon the duty module (clusters of tasks that tend to go together occupationally and organizationally in meaningful ways) is being designed for the Army. The purpose is to improve communications between resource and requirement planners and program operators. The paper proposes two…

Descriptors: Evaluation Methods, Individual Testing, Military Personnel, Models

A Function-Centered Model of Interest Assessment for Business Careers

Peer reviewed

Direct link

Butler, Timothy; Waldroop, James – Journal of Career Assessment, 2004

The authors argue that an effective way to describe the manifestation of interest patterns within a particular work domain is through a nuanced description of interests in terms of the essential functional activities common to that domain. Focusing on the domain of business work and studying a large sample of business professionals over a 15-year…

Descriptors: Psychometrics, Interest Inventories, Business, Vocational Interests

Using Cognitive Science to Assign Test Weights.

Peer reviewed

Bhaskar, R.; Dillard, Jesse F. – Instructional Science, 1983

Description of an objective method for assigning weights to questions on examinations includes discussions of classical test theory, knowledge organization, and how task analysis can be used to identify knowledge elements required to solve specific problems, rank them, and assign objective weights to exam questions using a Pareto distribution (7…

Descriptors: Accounting, Epistemology, Evaluation Methods, Item Analysis

Draft of a Model for Vocational Student Assessment.

Southern Association of Colleges and Schools, Atlanta, GA. – 1983

This volume contains the initial draft of a model for assessing students in vocational education programs in Georgia. Addressed in the first section of the draft are some of the components that are believed to be critical in the development of a model for assessing vocational student achievement, including selecting a program for use in developing…

Descriptors: Academic Achievement, Behavioral Objectives, Criterion Referenced Tests, Guidelines

Proficiency-Referencing a Reading Achievement Test.

Bormuth, John R. – 1979

A procedure is demonstrated for constructing tables showing, for each score on a commercial reading achievement test, the percentage of real-world materials that the testee is likely to comprehend with at least a criterion level of proficiency, the percentages of students in a local or national sample who can competently comprehend a given…

Descriptors: Criterion Referenced Tests, Elementary Secondary Education, Equivalency Tests, Expectancy Tables

Bhaskar, R.	1
Bormuth, John R.	1
Butler, Timothy	1
Cook, Robert J.	1
Dillard, Jesse F.	1
Durning, Steven J.	1
Edward Paul Getman	1
Geller, Josh	1
Jia, Yujie	1
Jung, EunJu	1
Ketterlin-Geller, Leanne R.	1
Liu, Kimy	1
Piech, Chris	1
Sheetz, Steven D.	1
Stephenson, Robert W.	1
Tack, Anaïs	1
Waldroop, James	1
Wilson, E. Vance	1
Yovanoff, Paul	1
More ▼