Tag Archives: Sara Baume

Literary Cluster Analysis

I: Introduction

My PhD research will involve arguing that there has been a resurgence of modernist aesthetics in the novels of a number of contemporary authors. These authors are Anne Enright, Will Self, Eimear McBride and Sara Baume. All these writers have at various public events and in the course of many interviews, given very different accounts of their specific relation to modernism, and even if the definition of modernism wasn’t totally overdetermined, we could spend the rest of our lives defining the ways in which their writing engages, or does not engage, with the modernist canon. Indeed, if I have my way, this is what I will spend a substantial portion of my life doing.

It is not in the spirit of reaching a methodology of greater objectivity that I propose we analyse these texts through digital methods; having begun my education in statistical and quantitative methodologies in September of last year, I can tell you that these really afford us no *better* a view of any text then just reading them would, but fortunately I intend to do that too.

This cluster dendrogram was generated in R, and owes its existence to Matthew Jockers’ book Text Analysis with R for Students of Literature, from which I developed a substantial portion of the code that creates the output above.

What the code is attentive to, is the words that these authors use the most. When analysing literature qualitatively, we tend to have a magpie sensibility, zoning in on words which produce more effects or stand out in contrast to the literary matter which surrounds it. As such, the ways in which a writer would use the words ‘the’, ‘an’, ‘a’, or ‘this’, tends to pass us by, but they may be far more indicative of a writer’s style, or at least in the way that a computer would be attentive to; sentences that are ‘pretty’ are generally statistically insignificant.

II: Methodology

Every corpus that you can see in the above image was scanned into R, and then run through a code which counted the number of times every word was used in the text. The resulting figure is called the word’s frequency, and was then reduced down to its relative frequency, by dividing the figure by total number of words, and multiplying the result by 100. Every word with a relative frequency above a certain threshold was put into a matrix, and a function was used to cluster each matrix together based on the similarity of the figures they contained, according to a Euclidean metric I don’t fully understand.

The final matrix was 21 X 57, and compared these 21 corpora on the basis of their relative usage of the words ‘a’, ‘all’, ‘an’, ‘and’, ‘are’, ‘as’, ‘at’, ‘be’, ‘but’, ‘by’, ‘for’, ‘from’, ‘had’, ‘have’, ‘he’, ‘her’, ‘him’, ‘his’, ‘I’, ‘if’, ‘in’, ‘is’, ‘it’, ‘like’, ‘me’, ‘my’, ‘no’, ‘not’, ‘now’, ‘of’, ‘on’, ‘one’, ‘or’, ‘out’, ‘said’, ‘she’, ‘so’, ‘that’, ‘the’, ‘them’, ‘then’, ‘there’, ‘they’, ‘this’, ‘to’, ‘up’, ‘was’, ‘we’, ‘were’, ‘what’, ‘when’, ‘which’, ‘with’, ‘would’, and ‘you’.

Anyway, now we can read the dendrogram.

III: Interpretation

Speaking about the dendrogram in broad terms can be difficult for precisely the reason that I indicative above; quantitative/qualitative methodologies for text analysis are totally opposed to one another, but what is obvious is that Eimear McBride and Gertrude Stein are extreme outliers, and comparable only to each other. This is one way unsurprising, because of the brutish, repetitive styles and is in other ways very surprising, because McBride is on record as dismissing her work, for being ‘too navel-gaze-y.’

Jorge Luis Borges and Marcel Proust have branched off in their own direction, as has Sara Baume, which I’m not quite sure what to make of. Franz Kafka, Ernest Hemingway and William Faulkner have formed their own nexus. More comprehensible is the Anne Enright, Katherine Mansfield, D.H. Lawrence, Elizabeth Bowen, F. Scott FitzGerald and Virginia Woolf cluster; one could make, admittedly sweeping judgements about how this could be said to be modernism’s extreme centre, in which the radical experimentalism of its more revanchiste wing was fused rather harmoniously with nineteenth-century social realism, which produced a kind of indirect discourse, at which I think each of these authors excel.

These revanchistes are well represented in the dendrogram’s right wing, with Flann O’Brien, James Joyce, Samuel Beckett and Djuna Barnes having clustered together, though I am not quite sure what to make of Ford Madox Ford/Joseph Conrad’s showing at all, being unfamiliar with the work.

IV: Conclusion

The basic rule in interpreting dendrograms is that the closer the ‘leaves’ reach the bottom, the more similar they can be said to be. Therefore, Anne Enright and Will Self are the contemporary modernists most closely aligned to the forebears, if indeed forebears they can be said to be. It would be harder, from a quantitative perspective, to align Sara Baume with this trend in a straightforward manner, and McBride only seems to correlate with Stein because of how inalienably strange their respective prose styles are.

The primary point to take away here, if there is one, is that more investigations are required. The analysis is hardly unproblematic. For one, the corpus sizes vary enormously. Borges’ corpus is around 46 thousand words, whereas Proust reaches somewhere around 1.2 million. In one way, the results are encouraging, Borges and Barnes, two authors with only one texts in their corpus, aren’t prevented from being compared to novelists with serious word counts, but in another way, it is pretty well impossible to derive literary measurements from texts without taking their length into account. The next stage of the analysis will probably involve breaking the corpora up into units of 50 thousand words, so that the results for individual novels can be compared.

Re-reading Eimear McBride’s ‘A Girl is a Half-Formed Thing’

A book that I’m looking forward to reading, that doesn’t exist yet, is an academic account of how Irish contemporary fiction went, in such a short space of time, from social realism, to the precociously sentenced art writing with dissociative narrators that now composes the Irish literary milieu. It’s the sort of thing that was probably brewing for a long time, these trends tend to be, but I first became aware of it when Eimear McBride’s A Girl is a Half-Formed Thing was published in 2013. It caused a bit of stir in the literary press at the time, for its supposed uncompromising experimentalism, and its fraught, J.K. Rowling-esque publication history. Critics compared it to Marcel Proust or Samuel Beckett, but I don’t think there was a single review that didn’t mention James Joyce.

In the works of Sara Baume, Joanna Walsh or Claire-Louise Bennett, there are certainly comparisons to be made along these lines, but I think McBride is the novelist of the current generation who is suffering most egregiously under these comparisons. This leads to a kind of distortion that McBride has spoken about recently, saying that it’s ‘a way of not being seen’. Claire Lowdon, writing on McBride’s prose style in Areté, has used the Joyce comparisons as a way of demeaning the novel’s experimental qualities, saying that they are ‘redundant’ and ‘artificial’:

Having invoked Joyce, Joyce has to be McBride’s standard. She has taken all the difficulty and none of the brilliance.

Lowdon’s reading is important and thorough, but I have problems with it. The most significant one being that I think it’s nonsensical to say that just because a work is in some way formally indebted to Joyce has to be 1) as good, 2) as innovative and 3) as good and as innovative in exactly the same ways. I think it’s a very strange point to make that we should benchmark a writer relative to their influences , particularly when this is a comparison furthered more by the laziness of critics than something that McBride has taken upon herself. It’s also inadequate to assume McBride and Joyce’s modernisms are coterminous; I happen to think that they’re rather distinct in a number of significant ways.

Firstly, it’s clear that A Girl is more formally aligned with the Wake than with Ulysses, but taken relative to the former, A Girl manifests far less attention to the materiality of language. In A Girl, there’s less puns, there’s less references, there’s less leitmotifs. It’s also possible to make sense of A Girl without reference to other works. But it’s a mistake to regard this as McBride’s failure to live up to her twentieth century modernist aesthetics. An example from the novel’s opening that Lowdon cites reads as follows:

For you. You’ll soon. You’ll give her name. In the stitches of her skin she’ll wear your say. Mammy me? Yes you. Bounce the bed I’d say. I’d say that’s what you did. Then lay you down. They cut you round. Wait and hour and day.

‘Wait and hour and day’, carries with it the vague association with the phrase ‘a year and a day’ but it doesn’t strictly make sense in that context, there’s no clear reason for the semantic distortion. But there’s also no requirement that there is, nor that it add up to some enormous mythic framework in the same way that the Wake does. I think that once we approach the novel from this position, one which takes account of McBride’s actual concerns, we’ll be able to come to a more sophisticated understanding that doesn’t amount to downgrading her because of her perceived inadequacy in relation to Joyce.

By her own admission McBride retains an interest in nineteenth century novels with less self-consciousness about their language or processes of meaning-making. She has cited the work of the Russian novelist Fyodor Dostoevsky as significant, particularly as an example of proto-modernism, or modernism in a nascent stage of its development, wherein human intersubjectivity was beginning to make itself known within the novel while the tenets of realistic fiction was still trying to accommodate it. Being aware of the fact that The Lesser Bohemians is not the novel under discussion, it’s important to note the way in which it demonstrates this interplay. Within the context of what has been referred to by the author as a ‘modernist monologue’ there is a very sensationalistic narrative in which a character lays out their life story in a very direct and straightforward manner in the same way that you might find extended and directly rendered narratives nested within nineteenth century novels. McBride has said that this is a very deliberate formal mechanic which is pertinent to the text’s thematic concerns, as it is a novel about relating to another person in spite of one’s traumatic past:

In the end you tell a person and you have to use the words that they’ll understand.

What makes McBride’s modernism distinct then, is the centrality it gives to the conveying of narrative information, deploying it as a means of bringing the reader closer to

physical experience, to write about the female experience…the reader can partake in the experience.

McBride has said that the language of A Girl, was written in a way that would create a physical experience for the reader, an immediacy on the page that is reminiscent of theatre. She’s expressed frustration at the content of many of her reviews which have emphasised the quality of the language at the expense of the novel’s content, which she regards as very significant. This stands in contrast to the tradition of the Wake or other modernist works famed for their unintelligibility, such as Gertrude Stein’s The Making of Americans: Being a History of a Family’s Progress is a novel that she has spoken about dismissively for being ‘too navel-gaze-y.’

This stated interest in what the book is ‘about’ and a reader-centric ethic, is I think at least a partial reversal of expectations within the modernist tradition. McBride’s modernism is therefore conceptualised, not as a constructed textual estrangement from reality, but an attempt to bring it closer, to a dwelling-place of authentic being. Not that it’s likely to close off such comparisons in the future.