ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	42

Descriptor

Comparative Analysis	96
Evaluation	96
Academic Achievement	21
Test Validity	20
Test Reliability	17
Foreign Countries	16
Tests	15
Statistical Analysis	13
Testing	12
Questionnaires	11
Teaching Methods	11
Test Results	11
Higher Education	10
Correlation	9
Elementary Education	9
Scores	9
Student Attitudes	9
Achievement Tests	8
Data Analysis	8
Program Effectiveness	8
College Students	7
Comparative Testing	7
Factor Analysis	7
Standardized Tests	7
Teacher Attitudes	7
More ▼

Publication Type

Reports - Research	40
Journal Articles	39
Reports - Descriptive	9
Reports - Evaluative	9
Dissertations/Theses -…	3
Collected Works - Proceedings	2
Information Analyses	2
Speeches/Meeting Papers	2
Books	1
Tests/Questionnaires	1

Education Level

Higher Education	11
Postsecondary Education	6
Secondary Education	6
Elementary Education	4
Elementary Secondary Education	3
Grade 3	2
Grade 4	2
High Schools	2
Early Childhood Education	1
Grade 12	1
Grade 2	1
Grade 8	1
Grade 9	1
Kindergarten	1
Primary Education	1
Two Year Colleges	1
More ▼

Audience

Practitioners	2
Researchers	2
Parents	1
Policymakers	1
Teachers	1

Location

Australia	5
United States	5
Canada	3
China	3
New Mexico	3
Taiwan	3
Asia	2
Japan	2
Mexico	2
New York (New York)	2
South Korea	2
Afghanistan	1
Bangladesh	1
Bhutan	1
Botswana	1
California (Berkeley)	1
Cambodia	1
Chile	1
Chile (Santiago)	1
Colombia	1
Germany	1
Greece	1
India	1
Indonesia	1
Israel (Jerusalem)	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 1 to 15 of 96 results Save | Export

Generating Social and Emotional Skill Items: Humans vs. ChatGPT. ACT Research. Issue Brief

Download full text

Kate E. Walton; Cristina Anguiano-Carrasco – ACT, Inc., 2024

Large language models (LLMs), such as ChatGPT, are becoming increasingly prominent. Their use is becoming more and more popular to assist with simple tasks, such as summarizing documents, translating languages, rephrasing sentences, or answering questions. Reports like McKinsey's (Chui, & Yee, 2023) estimate that by implementing LLMs,…

Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction

Is Education Losing the Race with Technology? AI's Progress in Maths and Reading. Educational Research and Innovation

Direct link

OECD Publishing, 2023

Advances in artificial intelligence (AI) are ushering in a large and rapid technological transformation. Understanding how AI capabilities relate to human skills and how they develop over time is crucial for understanding this process. In 2016, the OECD assessed AI capabilities with the OECD's Survey of Adult Skills (PIAAC). The present report…

Descriptors: Artificial Intelligence, Adults, Reading Tests, Mathematics Tests

Use of Response Process Data to Inform Group Comparisons and Fairness Research

Peer reviewed

Direct link

Ercikan, Kadriye; Guo, Hongwen; He, Qiwei – Educational Assessment, 2020

Comparing group is one of the key uses of large-scale assessment results, which are used to gain insights to inform policy and practice and to examine the comparability of scores and score meaning. Such comparisons typically focus on examinees' final answers and responses to test questions, ignoring response process differences groups may engage…

Descriptors: Data Use, Responses, Comparative Analysis, Test Bias

Do National Test Scores and Quality Labels Trigger School Self-Assessment and Accountability? A Critical Analysis in the Chilean Context

Peer reviewed

Direct link

Falabella, Alejandra – British Journal of Sociology of Education, 2016

Using qualitative data from two Chilean public schools, I interrogate the expectation that standardised testing motivates staff to critically self-assess themselves and to be accountable for failing evaluations. The research findings bring new insights into looking at ways in which school members, especially head managers, strategically debate,…

Descriptors: Tests, Scores, Accountability, Criticism

Investigating Best Practices in Utilizing a Web-Based Assessment Tool in an Introductory Geology Course: "CLASS," Course Setting and Course Structure

Peer reviewed

Direct link

Jones, Jason P.; McConnell, David A. – Journal of Geoscience Education, 2023

In the past couple of decades, the geoscience education community has made great strides toward investigating how to provide effective student learning experiences in the college setting. While experiences such as student-centered teaching strategies and course design elements are useful for the instructor, they may not make important elements of…

Descriptors: Geology, Introductory Courses, Science Instruction, Teaching Methods

Evaluation of "e-rater"® for the "Praxis I"®Writing Test. Research Report. ETS RR-15-03

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M. – ETS Research Report Series, 2015

Automated scoring models were trained and evaluated for the essay task in the "Praxis I"® writing test. Prompt-specific and generic "e-rater"® scoring models were built, and evaluation statistics, such as quadratic weighted kappa, Pearson correlation, and standardized differences in mean scores, were examined to evaluate the…

Descriptors: Writing Tests, Licensing Examinations (Professions), Teacher Competency Testing, Scoring

Rater Accuracy and Training Group Effects in Expert- and Supervisor-Based Monitoring Systems

Peer reviewed

Direct link

Baird, Jo-Anne; Meadows, Michelle; Leckie, George; Caro, Daniel – Assessment in Education: Principles, Policy & Practice, 2017

This study evaluated rater accuracy with rater-monitoring data from high stakes examinations in England. Rater accuracy was estimated with cross-classified multilevel modelling. The data included face-to-face training and monitoring of 567 raters in 110 teams, across 22 examinations, giving a total of 5500 data points. Two rater-monitoring systems…

Descriptors: Foreign Countries, High Stakes Tests, Accuracy, Hierarchical Linear Modeling

Whose Knowledge Counts in International Student Assessments: Examining the AHELO Epistemic Community of Economics Experts

Peer reviewed

Direct link

Nguyen, David J. – Tertiary Education and Management, 2016

International student assessments have become the "lifeblood" of the accountability movement in educational policy contexts. Drawing upon Stuart Hall's concept of representation, I critically examined who comprises epistemic communities responsible for developing the Organization for Economic Co-operation and Development's Assessment of…

Descriptors: Student Evaluation, Foreign Students, Epistemology, Expertise

Development and Initial Evaluation of the ClearSpeak Style for Automated Speaking of Algebra. Research Report. ETS RR-16-23

Peer reviewed
PDF on ERIC

Download full text

Frankel, Lois; Brownstein, Beth; Soiffer, Neil; Hansen, Eric – ETS Research Report Series, 2016

The work described in this report is the first phase of a project to provide easy-to-use tools for authoring and rendering secondary-school algebra-level math expressions in synthesized speech that is useful for students with blindness or low vision. This report describes the initial development, software implementation, and evaluation of the…

Descriptors: Algebra, Automation, Secondary School Mathematics, Artificial Speech

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Gauge Impact with 5 Levels of Data

Peer reviewed

Direct link

Guskey, Thomas R. – Journal of Staff Development, 2016

Effective professional learning evaluation requires consideration of five critical stages or levels of information. These five levels, which are presented in this article, represent an adaptation of an evaluation model developed by Kirkpatrick (1959, 1998) for judging the value of supervisory training programs in business and industry.…

Descriptors: Hierarchical Linear Modeling, Outcomes of Education, Supervisory Training, Faculty Development

WWC Review of the Report "Milwaukee Parental Choice Program Longitudinal Educational Growth Study Fifth Year Report"

Peer reviewed
PDF on ERIC

Download full text

What Works Clearinghouse, 2012

The study reviewed in this report examined the effectiveness of the "Milwaukee Parental Choice Program" ("MPCP"), which provides vouchers for low-income students to attend private schools. The study analyzed data on about 600 students who were given "MPCP" vouchers in the 2006-07 school year. The authors created a…

Descriptors: Private Schools, Evaluation, Reading Tests, Standardized Tests

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

Evaluating Different Standard-Setting Methods in an ESL Placement Testing Context

Peer reviewed

Direct link

Shin, Sun-Young; Lidster, Ryan – Language Testing, 2017

In language programs, it is crucial to place incoming students into appropriate levels to ensure that course curriculum and materials are well targeted to their learning needs. Deciding how and where to set cutscores on placement tests is thus of central importance to programs, but previous studies in educational measurement disagree as to which…

Descriptors: Language Tests, English (Second Language), Standard Setting (Scoring), Student Placement

The Langer-Improved Wald Test for DIF Testing with Multiple Groups: Evaluation and Comparison to Two-Group IRT

Peer reviewed

Direct link

Woods, Carol M.; Cai, Li; Wang, Mian – Educational and Psychological Measurement, 2013

Differential item functioning (DIF) occurs when the probability of responding in a particular category to an item differs for members of different groups who are matched on the construct being measured. The identification of DIF is important for valid measurement. This research evaluates an improved version of Lord's X[superscript 2] Wald test for…

Descriptors: Test Bias, Item Response Theory, Computation, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Psychology in the Schools	6
ProQuest LLC	3
ETS Research Report Series	2
Educational Measurement:…	2
Elementary School Journal	2
Practical Assessment,…	2
ACT, Inc.	1
Academic Psychiatry	1
Advances in Health Sciences…	1
Applied Psychological…	1
Assessment in Education:…	1
Association for Educational…	1
Australian Council for…	1
Australian Journal of…	1
British Journal of Sociology…	1
CALICO Journal	1
Change: The Magazine of…	1
Contemporary Educational…	1
Creativity Research Journal	1
Death Studies	1
Diagnostique	1
Educational Assessment	1
Educational Studies	1
Educational and Psychological…	1
Elementary English	1
More ▼

Petrosko, Joseph M.	2
Stallings, Jane A.	2
Van Matre, Nicholas H.	2
Andersen, Lisa M. J.	1
Anderson, Susan M.	1
Angoff, William H.	1
Baird, Jo-Anne	1
Balon, Richard	1
Bardo, John	1
Bejerot, Susanne	1
Bikson, Tora Kay	1
Bjorksten, Karin S.	1
Bong, Mimi	1
Brough, Theodore G.	1
Brownstein, Beth	1
Cai, Li	1
Callahan, Carolyn M.	1
Care, Esther	1
Caro, Daniel	1
Caws, Catherine	1
Chandrasegaran, A. L.	1
Chapelle, Carol A.	1
Chissom, Brad S.	1
More ▼

Program for International…	3
School and College Ability…	3
Wechsler Intelligence Scale…	3
Dale Chall Readability Formula	2
National Assessment of…	2
New York State Regents…	2
Peabody Picture Vocabulary…	2
Sequential Tests of…	2
Childhood Autism Rating Scale	1
Columbia Mental Maturity Scale	1
Illinois Test of…	1
International Adult Literacy…	1
International Association for…	1
Iowa Tests of Basic Skills	1
McCarthy Scales of Childrens…	1
Metropolitan Readiness Tests	1
National Survey of Student…	1
Praxis Series	1
Program for the International…	1
Progress in International…	1
Slosson Intelligence Test	1
Stanford Achievement Tests	1
Stanford Binet Intelligence…	1
Strong Campbell Interest…	1
Trends in International…	1
More ▼