The Impact Of Rater Background On The Reliability And Validity Of Writing Assessment In College English Final Examination

Posted on:2019-10-26

Degree:Master

Type:Thesis

Country:China

Candidate:X X Mao

Full Text:PDF

GTID:2405330545957054

Subject:Foreign Linguistics and Applied Linguistics

Abstract/Summary:

PDF Full Text Request

As an essential task,writing is an indispensable part in both classroom-based and largescale standardized assessments.It can examine students’ integrated language ability very efficiently.However,the accuracy and fairness of writing assessment have long become a problematic area owing to its subjectivity.Great discrepancy may occur in the scores of an essay assigned by different raters,which threatens the reliability,validity,and ultimately fairness of writing assessment.Eckes(2008)has pointed out that rater variability is one of the biggest challenges faced by language assessment researchers.Existing studies have revealed that rater background is one of the most important sources,leading to the deviation of writing scores to the “true score”.Within the framework of generalizability theory,this study examined the impact of rater background on the reliability and validity of writing assessment in college English final examination.To achieve this goal,the six raters engaged in this study were classified into different group based on their gender,educational background and writing rating experience.There were two groups under each background facet,with three raters in each group.The writing scores assigned by different rater groups were compared to find out whether there were significant differences.The writing samples used in the present study were 60 compositions collected from the English final test in June 2017.These compositions were all written by non-English major freshmen and rated by six raters holistically on a scale of 1-15 points.Each rater rated the compositions independently.The final writing scores were analyzed in EXCEL and GENOVA.Data analyses demonstrate that in the current 6-rater study design,the overall reliability and validity of the writing assessment were relatively low.Pairwise comparisons suggest that compared with female raters,male raters were less consistent in the rating process,but there were no obvious differences between the two groups in terms of rating reliability,convergent validity and discriminant validity;compared with raters with language testing educational background,there were much lower consistency,reliability,convergent validity and discriminant validity among raters without language testing educational background;compared with experienced raters,inexperienced raters were more inconsistent while rating,with lower reliability,convergent validity and discriminant validity.Finally,two raters were selected for semi-structured interview.The results indicate that the language testing educational background and rating experience exert impact on raters’ rating belief and behaviors to a certain extent,hence influencing the final writing scores.The findings in the present study reveal that raters’ gender has no influence on the quality of writing rating in the final test,but their language testing educational background and rating experience do.Therefore,the examination of causes of rater bias is very important for the investigation of reliability and validity issues in writing assessment.Once the underlying sources of rater bias are understood,effective measures can be taken to improve the reliability,validity and ultimately the fairness of a writing assessment.

Keywords/Search Tags:

Writing Assessment, Reliability, Validity, Rater Background, College English Final Examination

PDF Full Text Request

Related items

1	Using Generalizability Theory To Examine The Variability And Reliability Of Holistic Versus Analytic Scores In English Writing Assessment
2	A Rasch-based Study On Rater Effects In Writing Assessment
3	The Evaluation And Research Of Rater Reliability With LONGFORD Method
4	A Validation Study Of Reading-Writing Integrated Asseesment In College English Classroom
5	An Analysis Of The Reliability And Validity Of Rating Procedures In Test Of English Proficiency At Level A
6	E-rater VS. Human Rater In EFL Writing Assessment
7	Assessment of the human factors analysis and classification system (hfacs): Intra-rater and inter-rater reliability
8	A Study On The Validation Of An EFL Writing Rubric
9	Hendrich Fall Risk Assessment Scale Localization And Reliability And Validity Of Evaluation
10	A Preliminary Study Of Rater Language Background In The English Writing Assessment