Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 32 |
Since 2006 (last 20 years) | 80 |
Descriptor
Correlation | 117 |
Scoring | 117 |
Test Validity | 61 |
Test Reliability | 36 |
Validity | 36 |
Computer Assisted Testing | 29 |
Scores | 29 |
Statistical Analysis | 27 |
Comparative Analysis | 25 |
Foreign Countries | 23 |
Factor Analysis | 20 |
More ▼ |
Source
Author
Attali, Yigal | 5 |
Anna-Maria Fall | 2 |
Beula M. Magimairaj | 2 |
Breland, Hunter M. | 2 |
Forthmann, Boris | 2 |
Greene, John F. | 2 |
Greg Roberts | 2 |
Kyllonen, Patrick | 2 |
Liu, Ou Lydia | 2 |
Philip Capin | 2 |
Ramineni, Chaitanya | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 27 |
Postsecondary Education | 18 |
Secondary Education | 12 |
High Schools | 7 |
Elementary Education | 6 |
Elementary Secondary Education | 6 |
Early Childhood Education | 3 |
Grade 3 | 3 |
Grade 8 | 3 |
Kindergarten | 3 |
Grade 2 | 2 |
More ▼ |
Audience
Researchers | 3 |
Practitioners | 1 |
Location
Turkey | 6 |
China | 3 |
Japan | 3 |
United States | 3 |
United Kingdom (England) | 2 |
Australia | 1 |
California | 1 |
Canada | 1 |
Chile | 1 |
China (Shanghai) | 1 |
Colombia | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Shermis, Mark D. – Journal of Educational Measurement, 2022
One of the challenges of discussing validity arguments for machine scoring of essays centers on the absence of a commonly held definition and theory of good writing. At best, the algorithms attempt to measure select attributes of writing and calibrate them against human ratings with the goal of accurate prediction of scores for new essays.…
Descriptors: Scoring, Essays, Validity, Writing Evaluation
Conti, Gary J. – Journal of Education and Learning, 2023
The use of personality inventories has been limited because of their cost and the length. To overcome these limitations, this study created the Personality Identity Estimator (PIE), an easy-to-use inventory to estimate personality types that can be used at no cost. PIE is a categorical inventory containing 12 items with 3 items for each of the 4…
Descriptors: Personality Measures, Personality Traits, Validity, Reliability
Saban-Dülger, Nur Seda; Turan, Figen; Özcebe, Esra – Journal of Speech, Language, and Hearing Research, 2022
Purpose: Language sampling analysis (LSA) plays an important role in evaluating language skills; hence, the study aimed to develop new assessment measures for the LSA in Turkish as alternatives to mean length of utterance (MLU) and the Language Assessment, Remediation and Screening Procedure. With this aim, Developmental Sentence Scoring (DSS) and…
Descriptors: Syntax, Turkish, Speech Communication, Correlation
Forthmann, Boris; Oyebade, Oluwatosin; Ojo, Adebusola; Günther, Fritz; Holling, Heinz – Journal of Creative Behavior, 2019
Scoring divergent-thinking response sets has always been challenging because such responses are not only open-ended in terms of number of ideas, but each idea may also be expressed by a varying number of concepts and, thus, by a varying number of words (elaboration). While many current studies have attempted to score the semantic distance in…
Descriptors: Semantics, Creative Thinking, Simulation, Correlation
Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024
Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…
Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques
Beaty, Roger E.; Johnson, Dan R.; Zeitlen, Daniel C.; Forthmann, Boris – Creativity Research Journal, 2022
Semantic distance is increasingly used for automated scoring of originality on divergent thinking tasks, such as the Alternate Uses Task (AUT). Despite some psychometric support for semantic distance -- including positive correlations with human creativity ratings -- additional work is needed to optimize its reliability and validity, including…
Descriptors: Semantics, Scoring, Creative Thinking, Creativity
LaVoie, Noelle; Parker, James; Legree, Peter J.; Ardison, Sharon; Kilcullen, Robert N. – Educational and Psychological Measurement, 2020
Automated scoring based on Latent Semantic Analysis (LSA) has been successfully used to score essays and constrained short answer responses. Scoring tests that capture open-ended, short answer responses poses some challenges for machine learning approaches. We used LSA techniques to score short answer responses to the Consequences Test, a measure…
Descriptors: Semantics, Evaluators, Essays, Scoring
Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…
Descriptors: Validity, Discourse Analysis, Databases, Scoring
Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022
Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…
Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments
Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018
Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…
Descriptors: Competence, Simulation, Allied Health Personnel, Certification
Wang, Qiao – Education and Information Technologies, 2022
This study searched for open-source semantic similarity tools and evaluated their effectiveness in automated content scoring of fact-based essays written by English-as-a-Foreign-Language (EFL) learners. Fifty writing samples under a fact-based writing task from an academic English course in a Japanese university were collected and a gold standard…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Scoring
Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Language, Speech, and Hearing Services in Schools, 2022
Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…
Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments
Carmichael, Jessica A.; Fraccaro, Rebecca L.; Nordstokke, David W. – Canadian Journal of School Psychology, 2014
Oral language skills are important to consider in school psychology practice, as they are directly tied to many areas of academic functioning. For example, research has demonstrated that oral language skills in early elementary school predict reading comprehension in later grades (Kendeou, van den Broek, White, & Lynch, 2009). With a…
Descriptors: Language Tests, Oral Language, Language Skills, School Psychology
Davis, Larry; Norris, John – ETS Research Report Series, 2021
The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…
Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests
Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018
In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…
Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing