ERIC - Search Results

Publication Date

In 2025	9
Since 2024	16
Since 2021 (last 5 years)	40
Since 2016 (last 10 years)	62
Since 2006 (last 20 years)	151

Descriptor

Evaluation Methods	403
Test Validity	403
Test Reliability	182
Student Evaluation	124
Test Construction	118
Testing	106
Testing Problems	98
Computer Assisted Testing	85
Elementary Secondary Education	63
Foreign Countries	62
Educational Testing	49
Measurement Techniques	49
Educational Assessment	47
Standardized Tests	45
Testing Programs	45
Higher Education	41
Test Bias	40
Evaluation Criteria	39
Scores	36
Test Interpretation	35
Test Use	35
Psychometrics	34
Academic Achievement	32
Disabilities	28
Evaluation Research	28
More ▼

Education Level

Elementary Secondary Education	41
Higher Education	39
Postsecondary Education	27
Elementary Education	17
Secondary Education	14
Early Childhood Education	10
Middle Schools	6
Preschool Education	6
Grade 5	5
High Schools	5
Adult Education	4
Junior High Schools	4
Primary Education	4
Grade 6	3
Grade 7	3
Grade 8	3
Intermediate Grades	3
Two Year Colleges	3
Adult Basic Education	2
Grade 4	2
Grade 9	2
Kindergarten	2
Grade 1	1
Grade 10	1
Grade 2	1
More ▼

Audience

Practitioners	18
Researchers	16
Teachers	10
Administrators	5
Support Staff	3
Policymakers	2
Community	1
Counselors	1
Parents	1
Students	1

Location

California	7
United Kingdom	7
Canada	6
Germany	5
United Kingdom (England)	5
Australia	3
Indonesia	3
Israel	3
Arizona	2
India	2
Japan	2
Kenya	2
Michigan	2
Nebraska	2
Nigeria	2
Oman	2
Saudi Arabia	2
Singapore	2
Sweden	2
Tennessee	2
Texas	2
Turkey	2
United Arab Emirates	2
United Kingdom (Wales)	2
Utah	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	8
Elementary and Secondary…	6
Every Student Succeeds Act…	5
Individuals with Disabilities…	5
Rehabilitation Act 1973…	3
Elementary and Secondary…	2
Americans with Disabilities…	1
Comprehensive Education…	1
Education Consolidation…	1
Elementary and Secondary…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 403 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Response Process Evidence for Academic Assessments of Students with Significant Cognitive Disabilities

Peer reviewed
PDF on ERIC

Download full text

Meagan Karvonen; Russell Swinburne Romine; Amy K. Clark – Practical Assessment, Research & Evaluation, 2024

This paper describes methods and findings from student cognitive labs, teacher cognitive labs, and test administration observations as evidence evaluated in a validity argument for a computer-based alternate assessment for students with significant cognitive disabilities. Validity of score interpretations and uses for alternate assessments based…

Descriptors: Students with Disabilities, Intellectual Disability, Severe Disabilities, Student Evaluation

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Data Acquiring System for Gas Turbine Engine's Dynamic Performance; Build and Validate

Peer reviewed

Direct link

Mostafa M. Samy; Mohamed A. Metwally; Mahmoud Ashry; Wael M. Elmayyah – Measurement: Interdisciplinary Research and Perspectives, 2025

Gas Turbine Engines (GTE) have the highest power-to-weight ratio among Internal Combustion Engines (ICE). Its modularity and ability to utilize various types of fuel make it highly recommended in power plants, naval transportation, and, of course, the most equipped in aviation. The lack of GTEs' real data is increasing a recognized need for…

Descriptors: Engines, Power Technology, Data Collection, Data Interpretation

Assessing Vocabulary Knowledge in Written and Signed Languages of Immigrant DHH Learners -- Examining Convergent Validity

Peer reviewed

Direct link

Nicole Marx; Wolfgang Mann – Journal of Multilingual and Multicultural Development, 2025

Language assessment is a central aspect not only of language education in the general population, but also amongst heterogeneous, low-incidence populations. One such population are immigrant deaf and hard-of-hearing learners (IDML) who are bimodal-multilingual and whose languages development often includes the spoken, written, and/or signed…

Descriptors: Foreign Countries, German, Sign Language, Immigrants

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Poverty and Wealth without a Ladder? An Appraisal of the Stages of Progress Method among Agro-Pastoralists in Ethiopia's Lower Omo Valley

Peer reviewed

Direct link

Edward G. J. Stevenson; Jil Molenaar; David-Paul Pertaub; Dessalegn Tekle – Field Methods, 2025

Is it possible to measure wealth and poverty across settings while being faithful to local understandings? The stages of progress method (SoP) attempts to do this by building ladders of wealth in locally relevant terms and using these in comparisons across groups. This approach is potentially useful among pastoralist populations where monetary…

Descriptors: Foreign Countries, Poverty, Social Mobility, Evaluation Methods

Digital Games for Creativity Assessment: Strengths, Weaknesses and Opportunities

Peer reviewed

Direct link

Rafner, Janet; Biskjaer, Michael Mose; Zana, Blanka; Langsford, Steven; Bergenholtz, Carsten; Rahimi, Seyedahmad; Carugati, Andrea; Noy, Lior; Sherson, Jacob – Creativity Research Journal, 2022

Creativity assessments should be valid, reliable, and scalable to support various stakeholders (e.g., policy-makers, educators, corporations, and the general public) in their decision-making processes. Established initiatives toward scalable creativity assessments have relied on well-studied standardized tests. Although robust in many ways, most…

Descriptors: Creativity, Evaluation Methods, Video Games, Computer Assisted Testing

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Online Classroom-Based Reading Assessment: Comprehension and Practice Development

Peer reviewed
PDF on ERIC

Download full text

Eko Suhartoyo; Rida Afrilyasanti; Nur Mukminatien – Turkish Online Journal of Distance Education, 2025

In this paper, we investigated the impact of an online classroom-based reading assessment on implementing practices in reading instruction among 30 EFL learners in an intermediate reading course at a public university in East Java, Indonesia. Our study aimed to develop an online classroom-based reading assessment and evaluate its efficacy in…

Descriptors: Student Evaluation, Computer Assisted Testing, Reading Tests, Reading Instruction

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Exploring the Effectiveness of Large-Scale Automated Writing Evaluation Implementation on State Test Performance Using Generalised Boosted Modelling

Peer reviewed

Direct link

Yue Huang; Joshua Wilson – Journal of Computer Assisted Learning, 2025

Background: Automated writing evaluation (AWE) systems, used as formative assessment tools in writing classrooms, are promising for enhancing instruction and improving student performance. Although meta-analytic evidence supports AWE's effectiveness in various contexts, research on its effectiveness in the U.S. K-12 setting has lagged behind its…

Descriptors: Writing Evaluation, Writing Skills, Writing Tests, Writing Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 27

Measurement:…	12
Educational Measurement:…	8
ETS Research Report Series	4
Exceptional Children	4
Grantee Submission	4
Language, Speech, and Hearing…	4
Online Submission	4
Phi Delta Kappan	4
ProQuest LLC	4
Assessment & Evaluation in…	3
Educational Technology	3
Journal of Educational…	3
Language Testing	3
Measurement and Evaluation in…	3
Practical Assessment,…	3
Smarter Balanced Assessment…	3
Advances in Physiology…	2
Alberta Journal of…	2
Assessing Writing	2
Assessment and Evaluation in…	2
Assessment for Effective…	2
Career Development for…	2
Center on Standards and…	2
College Student Journal	2
Community College Research…	2
More ▼

Haertel, Geneva	3
Thurlow, Martha L.	3
Bagnato, Stephen J.	2
Baker, Eva L.	2
Bielinski, John	2
Boyle, Michael H.	2
Bracey, Gerald W.	2
Burling, Kelly	2
Cameto, Renee	2
Clarke-Midura, Jody	2
Cook, Linda	2
Cunningham, Charles E.	2
Fuchs, Lynn S.	2
Gearhart, Maryl	2
Hoepfner, Ralph	2
Hughes, Katherine L.	2
Jaeger, Richard M.	2
Klein, Stephen P.	2
Kratochwill, Thomas R.	2
Leitzel, Thomas C.	2
Linn, Robert L.	2
Macy, Marisa	2
Minnema, Jane	2
Murray, Elizabeth	2
More ▼

Journal Articles	202
Reports - Research	158
Reports - Evaluative	75
Opinion Papers	52
Speeches/Meeting Papers	44
Reports - Descriptive	43
Information Analyses	24
Guides - Non-Classroom	17
Collected Works - Proceedings	9
Tests/Questionnaires	9
Books	8
Guides - Classroom - Teacher	6
Reference Materials -…	5
Dissertations/Theses -…	4
Guides - General	4
Reports - General	4
Collected Works - General	3
ERIC Publications	2
Numerical/Quantitative Data	2
Collected Works - Serials	1
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
Legal/Legislative/Regulatory…	1
Reference Materials -…	1
More ▼

National Assessment of…	5
ACT Assessment	3
Cornell Critical Thinking Test	2
Self Directed Search	2
Watson Glaser Critical…	2
Woodcock Johnson Tests of…	2
Acculturation Rating Scale…	1
Adjective Check List	1
Advanced Placement…	1
California Psychological…	1
Child Abuse Potential…	1
Clinical Evaluation of…	1
Continuous Performance Test	1
Flanders System of…	1
Georgia Criterion Referenced…	1
Graduate Management Admission…	1
Holland Vocational Preference…	1
Lorge Thorndike Intelligence…	1
Maslach Burnout Inventory	1
Matching Familiar Figures Test	1
Measures of Academic Progress	1
National Teacher Examinations	1
Pediatric Evaluation of…	1
SAT (College Admission Test)	1
Sequential Tests of…	1
More ▼