What Do We Hear in Depp v. Heard?

As you probably know—whether you want to or not—the jury reached a verdict earlier this summer in the trial between Amber Heard and Johnny Depp. The trial, in the Fairfax County Circuit Court in Virginia, involved defamation and counter-defamation claims by the two actors. Heard published a 2018 op-ed in The Washington Post in which she claimed to be “a public figure representing domestic abuse.” Depp sued her for defamation, she counter-sued, and a seven-week spectacle of celebrity, misogyny, and power followed, in which Depp substantially prevailed.

What does a close listening to Depp v. Heard tell us about this particular trial, as well as about sex and power in the courtroom more generally? 

Depp v. Heard did not just randomly become a media circus. As Joanne Sweeny noted in Slate, the judge made two procedural rulings that led to the ensuing frenzy—and greatly tipped the scales toward the plaintiff. Firstly, the judge allowed cameras in the courtroom to broadcast the proceedings. The Code of Virginia leaves this decision largely up to the court’s discretion, but also stipulates that coverage of “proceedings concerning sexual offenses” is prohibited. Despite the content and high-profile nature of this case, Judge Penney Azcarate decided to proceed with the broadcast. 

Untitled Image by Flickr user SethTippie

Azcarate’s decision is strikingly at odds with the court’s emphasis on silence and decorum. Court order CL-2019-2911 stated, for example, that “Quiet and order shall be maintained at all times. Audible comments of any kind during the court proceedings … will not be tolerated.” In fact, Azcarate interrupted proceedings during trial to tell courtroom spectators to keep their mouths shut. During trial, extraneous noise is heard not just as uncivil but as a threat to impartiality and fairness. However, according to the judge’s logic, this threat is only perceived  within the courtroom. 

This brings us to the second procedural ruling of consequence here. Despite the frenzy enveloping the case, Azcarate decided not to sequester the jury. Jury sequestration involves  the members of the jury being isolated  from public and press during a trial, in order to avoid accidental or deliberate exposure to outside influence or information. Video from the courtroom flooded the internet and, as commentators have argued, likely and unduly influenced the jury, who were not isolated and prevented from accessing TV or social media. As Depp’s legions of supporters raged online, social media effectively became part of his legal team. This  work was done in great part through sound. 

Social media online commentary forensically dissected Heard’s oral testimony, noting changes in her breathing patterns or her speech cadence. Often they would hone in on the fact that she “exhale[d] erratically,” or “can talk so fast,” as seen in this Entertainment Tonight compilation:

The online jury adjudicated on all these vocal elements as proof that she was lying. One internet article described her in audiotape evidence as “cackle[ing] like a witch” and alternating between “laugh[ing] hysterically” and using a “baby voice.” Heard’s detractors took her voice as proof that she was emotionless, robotic, calculated, too well-rehearsed—but also that she was chaotic, nervous, crazy. 

In contrast, commentators described Depp’s voice as “calm,” “calming,” and “soothing,” with Tik Tok users hash-tagging ASMR to audio of him. One fan even posted a ninety-minute ASMR video of his testimony. Multiple Twitter users claimed that “you can hear the pain” in his voice, from an audiotape admitted during trial. At other times, he is applauded for “giggling” and laughing during the trial, with fans hearing it as evidence of his authenticity and “kind soul.” One YouTube commentator, Grandma WHOa, writes that they wish he would record an audiobook so they could “listen to his calming, sexy soothing man voice.” 

So far, so predictable. These are well-established, recognizable patterns about how we hear men’s vs women’s voices in public life—e.g. critiques of Hilary Clinton’s shrill, whiny voice. But listening in to the trial also reveals that this isn’t just a case of online fan culture on overdrive. Instead, it shows how broader social dynamics around gender and power don’t just create outside noise, but are built into formal legal practice within the courtroom.

Much of the conflict here follows a common pattern in defamation cases involving sexual violence claims, with questions around who gets to be a victim (see in my forthcoming piece in HAU: Journal of Ethnographic Theory titled “The Tone of Justice: Voicing the Perpetrator-as-Victim in Sexual Assault Cases”). Depp claimed to have suffered through the defamatory statement and through a longer history of abuse by Heard. His fans framed him as a hero and a victim, using the  social media hashtag #HeardIsAnAbuser. On the other hand, they refused to believe Heard’s claims of having suffered abuse. This determination was based at least in part on Heard’s vocal performance and courtroom testimony, with detractors hearing duplicity in her exhalations, her rapid pace, the occasional firmness and confidence of her tone. As one Depp supporter commented on a video of Heard’s testimony,  “There’s no way a victim sounds like this.” 

Yet in a key strategic move, Depp’s lawyers chose to make Heard sound precisely as sexual assault victims often do during trial. Seeking to dismantle her credibility, they looked to the toolkit of how to deal with a victim in court, mobilizing a well-worn set of legal techniques used to interrogate survivors of sexual violence. In one cross-examination, for example, the plaintiff’s counsel declares that Heard’s “lies have been exposed to the world multiple times.” This claim is then manifested through a series of vocal disciplinary tactics to undermine Heard’s testimony and depict her as a false witness.

For instance, the lawyer, Camille Vasquez, repeatedly employs a common interrogation technique of speaking over and cutting off Heard as she is replying to a question. As legal scholars and sociologists have shown, such techniques are often used in sexual assault cases to intimidate and shape perceptions of the complainant. In a pioneering study on courtroom talk during rape trials, Gregory Matoesian, in Reproducing rape: Domination through talk in the courtroom (1993) describes how lawyers reproduce patriarchal relations of dominance and subordination by “usurping” the witness’ ability to respond (186). As he notes, questions—wielded like weapons of attack by skillful lawyers—are more powerful than answers. 

Vocal technique and dynamics are key here. In Vasquez’s cross-examinations, she repeatedly raises her voice to interrupt Heard, disciplining her before the jury and spectators. She laughs at her testimony and infantilizes Heard, at times speaking to her in calm tones before quickly shifting to a harsher timbre. At one point, Vasquez snaps her notes shut and walks back to her seat while Heard is still answering her question. Heard is forced into abrupt silence. Unable to respond to the question she was asked, she audibly loses control of the narrative being spun. Vasquez also frequently speaks over her and directly to the judge, objecting that Heard is being non-responsive. The lawyer performs for the judge and jury her refusal to listen to Heard. 

At other moments, Vasquez’s voice and affect telegraph exasperation, as she audibly sighs while Heard attempts to answer a question. As Heard and Vasquez go back-and-forth over a line of questioning, Vasquez’s voice bristles with irritation as she speaks in clipped tones, with sharp inflection at the end of each line: “Yes?” “Right?” “Yes or no?” These interjections add an aural layer of interpretation to Heard’s testimony in real-time, guiding the jury to hear the witness as evasive and therefore unreliable. Vasquez’s expressions are all part of a careful vocal strategy, implicitly saying to the jury, “Can you believe this woman?” 

Screenshot from NBC Today video, “Amber Heard Breaks Silence: I Don’t Blame The Jury”

Of course, the answer is no. Jessica Winter, writing in The New Yorker, points out that Heard lost in part because of her “tearless crying,” the fact that she appeared insincere. Winter acknowledges that successful testimony is about “affect and presentation”, a reality that is no secret. In fact, jury instructions in Depp v. Heard clearly state that determinations of witness credibility are based in part on witnesses’ “appearance and manner.” Jurors must use their “common sense” to “determine which witnesses are more believable.” 

But how is “common sense” established? Listening closely to this trial reminds us that such understandings are constructed and regulated through sound as well as through determinants of “appearance and manner,” both in and out of the courtroom. Vasquez’s performance, Heard’s subordinated testimony, and the commentary of millions of avid consumers underline that Heard and Depp sound to many people exactly as common sense and conventional norms would dictate. 

A woman claiming abuse and assault at the hands of a more powerful man is always subject to patriarchal ways of listening, even if she is rich, famous, straight, and white. These ways of listening are contradictory. Research shows that “masculine” voices are heard as more authoritative and dominant, while women are often heard as weak, uncertain, lacking confidence. The public ear hears other racialized and gendered voices through similar power inequities, including queer, nonbinary, and LGBT voices or voices of people of color. In the context of sexual assault adjudication, however, Heather Hlavka and Sameena Mulla show in their Law & Society Review article “That’s How She Talks”: Animating Text Message Evidence in the Sexual Assault Trial” “that a confident voice and calm performance can work against a victim-witness in court, by suggesting that she is not passive or meek enough to be a ‘real’ victim.” On the other hand, they note that a victim-witness who cries on the stand may give the impression of performing or acting. Lawyers audibly manipulate these perceptions, as the examples here show, and men (particularly heteronormative, white men in positions of power) reap huge benefits from them.  

Many observers of Depp v. Heard have noted the toxic social media sludge around the case, as well as the danger that the verdict poses to survivors of domestic abuse and sexual assault. But listening closely to the proceedings shows us that these outcomes aren’t random and aren’t just part of informal processes like trial by Tik Tok. 

Instead, formal court proceedings manipulate and mobilize social scripts around survivors of sexual assault and domestic violence, and around women and marginalized others, to reach their outcomes. We can hear how this strategy plays out through sound and voice, from sighing and interrupting to laughter and silence. The jury instructions in Depp v. Heard state that “Our system of law does not permit jurors to be governed by sympathy, prejudice, or public opinion.” But despite claims that the legal system is based on objectivity and impartiality, we can hear that the law never exists in a bubble – and lawyers often and successfully rely on this very fact. 

Featured image: “Courtroom” by Flickr user Karen Neoh, CC BY 2.0

Nomi Dave is a former lawyer, interdisciplinary researcher, and co-director of the Sound Justice Lab at the University of Virginia, where she is Associate Professor of Music. She is currently co-writing  and co-directing a documentary film, Big Mouth, on a defamation lawsuit connected to a sexual violence case in Guinea.

tape reel

REWIND!…If you liked this post, you may also dig:

“People’s lives are at stake”: A conversation about Law, Listening, and Sound between James Parker and Lawrence English—Lawrence English and James Parker

Vocal Gender and the Gendered Soundscape: At the Intersection of Gender Studies and Sound Studies—Christine Ehrick

Or Does it Explode?: Sounding Out the U.S. Metropolis in Hansberry’s A Raisin in the Sun—Liana Silva

What Do We Hear in Depp v. Heard?

As you probably know—whether you want to or not—the jury reached a verdict earlier this summer in the trial between Amber Heard and Johnny Depp. The trial, in the Fairfax County Circuit Court in Virginia, involved defamation and counter-defamation claims by the two actors. Heard published a 2018 op-ed in The Washington Post in which she claimed to be “a public figure representing domestic abuse.” Depp sued her for defamation, she counter-sued, and a seven-week spectacle of celebrity, misogyny, and power followed, in which Depp substantially prevailed.

What does a close listening to Depp v. Heard tell us about this particular trial, as well as about sex and power in the courtroom more generally? 

Depp v. Heard did not just randomly become a media circus. As Joanne Sweeny noted in Slate, the judge made two procedural rulings that led to the ensuing frenzy—and greatly tipped the scales toward the plaintiff. Firstly, the judge allowed cameras in the courtroom to broadcast the proceedings. The Code of Virginia leaves this decision largely up to the court’s discretion, but also stipulates that coverage of “proceedings concerning sexual offenses” is prohibited. Despite the content and high-profile nature of this case, Judge Penney Azcarate decided to proceed with the broadcast. 

Untitled Image by Flickr user SethTippie

Azcarate’s decision is strikingly at odds with the court’s emphasis on silence and decorum. Court order CL-2019-2911 stated, for example, that “Quiet and order shall be maintained at all times. Audible comments of any kind during the court proceedings … will not be tolerated.” In fact, Azcarate interrupted proceedings during trial to tell courtroom spectators to keep their mouths shut. During trial, extraneous noise is heard not just as uncivil but as a threat to impartiality and fairness. However, according to the judge’s logic, this threat is only perceived  within the courtroom. 

This brings us to the second procedural ruling of consequence here. Despite the frenzy enveloping the case, Azcarate decided not to sequester the jury. Jury sequestration involves  the members of the jury being isolated  from public and press during a trial, in order to avoid accidental or deliberate exposure to outside influence or information. Video from the courtroom flooded the internet and, as commentators have argued, likely and unduly influenced the jury, who were not isolated and prevented from accessing TV or social media. As Depp’s legions of supporters raged online, social media effectively became part of his legal team. This  work was done in great part through sound. 

Social media online commentary forensically dissected Heard’s oral testimony, noting changes in her breathing patterns or her speech cadence. Often they would hone in on the fact that she “exhale[d] erratically,” or “can talk so fast,” as seen in this Entertainment Tonight compilation:

The online jury adjudicated on all these vocal elements as proof that she was lying. One internet article described her in audiotape evidence as “cackle[ing] like a witch” and alternating between “laugh[ing] hysterically” and using a “baby voice.” Heard’s detractors took her voice as proof that she was emotionless, robotic, calculated, too well-rehearsed—but also that she was chaotic, nervous, crazy. 

In contrast, commentators described Depp’s voice as “calm,” “calming,” and “soothing,” with Tik Tok users hash-tagging ASMR to audio of him. One fan even posted a ninety-minute ASMR video of his testimony. Multiple Twitter users claimed that “you can hear the pain” in his voice, from an audiotape admitted during trial. At other times, he is applauded for “giggling” and laughing during the trial, with fans hearing it as evidence of his authenticity and “kind soul.” One YouTube commentator, Grandma WHOa, writes that they wish he would record an audiobook so they could “listen to his calming, sexy soothing man voice.” 

So far, so predictable. These are well-established, recognizable patterns about how we hear men’s vs women’s voices in public life—e.g. critiques of Hilary Clinton’s shrill, whiny voice. But listening in to the trial also reveals that this isn’t just a case of online fan culture on overdrive. Instead, it shows how broader social dynamics around gender and power don’t just create outside noise, but are built into formal legal practice within the courtroom.

Much of the conflict here follows a common pattern in defamation cases involving sexual violence claims, with questions around who gets to be a victim (see in my forthcoming piece in HAU: Journal of Ethnographic Theory titled “The Tone of Justice: Voicing the Perpetrator-as-Victim in Sexual Assault Cases”). Depp claimed to have suffered through the defamatory statement and through a longer history of abuse by Heard. His fans framed him as a hero and a victim, using the  social media hashtag #HeardIsAnAbuser. On the other hand, they refused to believe Heard’s claims of having suffered abuse. This determination was based at least in part on Heard’s vocal performance and courtroom testimony, with detractors hearing duplicity in her exhalations, her rapid pace, the occasional firmness and confidence of her tone. As one Depp supporter commented on a video of Heard’s testimony,  “There’s no way a victim sounds like this.” 

Yet in a key strategic move, Depp’s lawyers chose to make Heard sound precisely as sexual assault victims often do during trial. Seeking to dismantle her credibility, they looked to the toolkit of how to deal with a victim in court, mobilizing a well-worn set of legal techniques used to interrogate survivors of sexual violence. In one cross-examination, for example, the plaintiff’s counsel declares that Heard’s “lies have been exposed to the world multiple times.” This claim is then manifested through a series of vocal disciplinary tactics to undermine Heard’s testimony and depict her as a false witness.

For instance, the lawyer, Camille Vasquez, repeatedly employs a common interrogation technique of speaking over and cutting off Heard as she is replying to a question. As legal scholars and sociologists have shown, such techniques are often used in sexual assault cases to intimidate and shape perceptions of the complainant. In a pioneering study on courtroom talk during rape trials, Gregory Matoesian, in Reproducing rape: Domination through talk in the courtroom (1993) describes how lawyers reproduce patriarchal relations of dominance and subordination by “usurping” the witness’ ability to respond (186). As he notes, questions—wielded like weapons of attack by skillful lawyers—are more powerful than answers. 

Vocal technique and dynamics are key here. In Vasquez’s cross-examinations, she repeatedly raises her voice to interrupt Heard, disciplining her before the jury and spectators. She laughs at her testimony and infantilizes Heard, at times speaking to her in calm tones before quickly shifting to a harsher timbre. At one point, Vasquez snaps her notes shut and walks back to her seat while Heard is still answering her question. Heard is forced into abrupt silence. Unable to respond to the question she was asked, she audibly loses control of the narrative being spun. Vasquez also frequently speaks over her and directly to the judge, objecting that Heard is being non-responsive. The lawyer performs for the judge and jury her refusal to listen to Heard. 

At other moments, Vasquez’s voice and affect telegraph exasperation, as she audibly sighs while Heard attempts to answer a question. As Heard and Vasquez go back-and-forth over a line of questioning, Vasquez’s voice bristles with irritation as she speaks in clipped tones, with sharp inflection at the end of each line: “Yes?” “Right?” “Yes or no?” These interjections add an aural layer of interpretation to Heard’s testimony in real-time, guiding the jury to hear the witness as evasive and therefore unreliable. Vasquez’s expressions are all part of a careful vocal strategy, implicitly saying to the jury, “Can you believe this woman?” 

Screenshot from NBC Today video, “Amber Heard Breaks Silence: I Don’t Blame The Jury”

Of course, the answer is no. Jessica Winter, writing in The New Yorker, points out that Heard lost in part because of her “tearless crying,” the fact that she appeared insincere. Winter acknowledges that successful testimony is about “affect and presentation”, a reality that is no secret. In fact, jury instructions in Depp v. Heard clearly state that determinations of witness credibility are based in part on witnesses’ “appearance and manner.” Jurors must use their “common sense” to “determine which witnesses are more believable.” 

But how is “common sense” established? Listening closely to this trial reminds us that such understandings are constructed and regulated through sound as well as through determinants of “appearance and manner,” both in and out of the courtroom. Vasquez’s performance, Heard’s subordinated testimony, and the commentary of millions of avid consumers underline that Heard and Depp sound to many people exactly as common sense and conventional norms would dictate. 

A woman claiming abuse and assault at the hands of a more powerful man is always subject to patriarchal ways of listening, even if she is rich, famous, straight, and white. These ways of listening are contradictory. Research shows that “masculine” voices are heard as more authoritative and dominant, while women are often heard as weak, uncertain, lacking confidence. The public ear hears other racialized and gendered voices through similar power inequities, including queer, nonbinary, and LGBT voices or voices of people of color. In the context of sexual assault adjudication, however, Heather Hlavka and Sameena Mulla show in their Law & Society Review article “That’s How She Talks”: Animating Text Message Evidence in the Sexual Assault Trial” “that a confident voice and calm performance can work against a victim-witness in court, by suggesting that she is not passive or meek enough to be a ‘real’ victim.” On the other hand, they note that a victim-witness who cries on the stand may give the impression of performing or acting. Lawyers audibly manipulate these perceptions, as the examples here show, and men (particularly heteronormative, white men in positions of power) reap huge benefits from them.  

Many observers of Depp v. Heard have noted the toxic social media sludge around the case, as well as the danger that the verdict poses to survivors of domestic abuse and sexual assault. But listening closely to the proceedings shows us that these outcomes aren’t random and aren’t just part of informal processes like trial by Tik Tok. 

Instead, formal court proceedings manipulate and mobilize social scripts around survivors of sexual assault and domestic violence, and around women and marginalized others, to reach their outcomes. We can hear how this strategy plays out through sound and voice, from sighing and interrupting to laughter and silence. The jury instructions in Depp v. Heard state that “Our system of law does not permit jurors to be governed by sympathy, prejudice, or public opinion.” But despite claims that the legal system is based on objectivity and impartiality, we can hear that the law never exists in a bubble – and lawyers often and successfully rely on this very fact. 

Featured image: “Courtroom” by Flickr user Karen Neoh, CC BY 2.0

Nomi Dave is a former lawyer, interdisciplinary researcher, and co-director of the Sound Justice Lab at the University of Virginia, where she is Associate Professor of Music. She is currently co-writing  and co-directing a documentary film, Big Mouth, on a defamation lawsuit connected to a sexual violence case in Guinea.

tape reel

REWIND!…If you liked this post, you may also dig:

“People’s lives are at stake”: A conversation about Law, Listening, and Sound between James Parker and Lawrence English—Lawrence English and James Parker

Vocal Gender and the Gendered Soundscape: At the Intersection of Gender Studies and Sound Studies—Christine Ehrick

Or Does it Explode?: Sounding Out the U.S. Metropolis in Hansberry’s A Raisin in the Sun—Liana Silva

Voice as Ecology: Voice Donation, Materiality, Identity

I first heard about voice donation while listening to “Being Siri,” an experimental audio piece about Erin Anderson donating her voice to Boston-based voice donation company, VocaliD. Like a digital blood bank of sorts, VocaliD provides a platform for donating one’s voice via digital audio recordings. These recordings are used to help technicians create a custom digital voice for a voiceless individual, providing an alternative to the predominately white, male, mechanical-sounding assistive technologies used by people who cannot vocalize for themselves (think Stephen Hawking). VocaliD manufactures voices that better match a person’s race, gender, ethnicity, age, and unique personality. To me, VocaliD encapsulates the promise, complexity, and problematic nature of our current speech AI landscape and serves as an example of why we need to think critically about sound technologies, even when they appear to be wholly beneficial.  

Given the extreme lack of sonic diversity in vocal assistive technologies, VocaliD provides a critically important service. But a closer look at both the rhetoric used by the organization and the material process involved in voice donation also amplifies the limits of overly simplistic, human-centric conceptions of voice. For instance, VocaliD rhetorically frames their service by persistently linking voice to humanity—to self, authenticity, individuality. Consider the following statements made by Rupal Patel, CEO and founder of VocaliD, in which she emphasizes the need for voice donation technology: 

“Here’s a way for us to acknowledge these individuals as unique human beings.” (Fast Company)

“I was talking to [a] girl we made a voice for. She told me that people are finally seeing her for who she really is.” (Medieros)

These are just a few examples from a larger discourse that reinforces the connection between voice and humanity. VocaliD’s repeated claims that their unique vocal identities humanize individuals imply that one is not fully human unless one’s voice sounds human. This rhetoric positions voiceless individuals as less than human (at least until they pay for a customized human-sounding voice). 

VocaliD’s conflation of voice and humanity makes me wonder about the meaning of “human” in this context. For example, notions of humanity have been historically associated with Western whiteness—and deployed as a means of separating or distinguishing white people from Others—as Alexander Weheliye points out. Though VocaliD’s mission is to diversify manufactured voices, is a “human-sounding” voice still construed as a white voice? Does sounding human mean sounding white? Even if there is a bank of sonically diverse voices to choose from, does racial bias show up in the pacing, phrasing, or inflection caused by the vocal technology? 

Photocredit: iphonedigital @Flickr CC BY-SA

I am also disturbed by the rhetoric of humanity and individuality used by VocaliD because the company adopts the same rhetoric to describe the AI voices they sell to brands for media and smart products. Here’s an example of this rhetoric from the VocaliD AI website: “When you need a voice that resonates, evokes audience empathy, and sounds like you, rather than your competitors, VocaliD’s AI-powered vocal persona is the solution. Your voice — always on, where you need it when you need it.” Using similar rhetorical strategies to describe both voiceless people and products is dehumanizing. And yet, having a more diverse AI vocal mediascape, especially in terms of race, is crucially important since voice-activated machines and products are designed largely by white men who end up reinforcing the sonic color line.

Interestingly, the processes VocaliD uses to create a custom voice reveal that these voices are not, in fact, unique markers of humanity or individuality. It’s hard to find a detailed account of how VocaliD voices are made due to the company’s patents, but here are the basics: VocaliD does not transfer a donated voice directly to a voiceless person’s assistive technology. VocaliD technicians instead blend and digitally manipulate the donated voice with recordings of the noises a voiceless person can make (a laugh, a hum) to create a distinct new voice for the recipient. In other words, donated voices are skillful remixes that wouldn’t be possible without extracting vocal data and manipulating it with digital tools. Despite perpetuating narratives about voice, humanity, and authenticity, VocaliD’s creative blending of vocal material reveals that donated voices are the result of compositional processes that involve much more than people.

Further, considering VocaliD voices from a material rather than human-centric perspective amplifies something important about voices in general. All voices are composed of and grounded in an ecology. That is, voices emerge and are developed through a mixture of: (1) biological makeup (or technological makeup in the case of machines with voices); (2) specific environments and contexts (geography may determine the kind of accents humans have; AI voices have distinct sounds for their brands); (3) technologies (phones, computers, digital recorders and editors, software, and assistive technologies preserve, circulate, and amplify voices); and (4) others (humans often emulate the vocal patterns of the people they interact with most; many machine voices also sound like other machine voices). Put simply, all voices are intentionally and unintentionally composed over time—shaped by ever-changing bodily (and/or technological) states and engagements with the world. Voices are dynamic compositions by nature. Examining voice from a material standpoint shows that voices are not static markers of humanity; voices are responsive and malleable because they are the result of a complex ecology that involves much more than a “unique” human being. 

However, focusing solely on the material aspects of vocality leaves out people’s lived experiences of voice. And based on online videos of VocaliD recipients—like Delaney, a seventeen-year-old with cerebral palsy—VocaliD voices seem to live up to the company’s hype. Delaney appears delighted by her new voice, stating: “I was so excited to get my own voice. I used to have a computer voice and now I sound like a girl. I like that. And I talk more.” Delaney’s teachers also discuss how her new voice completely changed her demeanor. Whereas before Delaney was reluctant to use her assistive technology to speak, her new voice gives her confidence and a stronger sense of identity. As her teacher explains in the video, “she is really engaged in groups, she wants to share her answers, she’s excited to talk with friends. It’s been really nice to see.” For Delaney, a VocaliD voice represents a newfound sense of agency. 

It’s important to recognize this video is not necessarily representative of every VocaliD recipient’s experience, or even Delaney’s full experience. As Meryl Alper notes in Giving Voice, these types of news stories “portray technology as allowing individuals to ‘overcome’ their disability as an individual limitation, and are intended to be uplifting and inspirational for able-bodied audiences” (27). While we should be wary of the technological determinism in the video, observing Delaney use her VocaliD voice—and listening to the emotional responses of her mom and teachers—makes it difficult to deny that donated voices make a positive impact. For me, this video also gets at a larger truth about humans and voice: the ways we hear and understand our own voices, and the ways others interpret the sounds of our voices, matter a great deal. Voices are integral to our identities—to the ways we understand and think about ourselves and others—and the sounds of our voices have social and material consequences, as the SO! Gendered Voices Forum illustrates so clearly. 

An image VocaliD used to advertise themselves on Twitter. Image used for purposes of critique.

It’s worth repeating that VocaliD’s mission to diversify synthetic voices is incredibly important, especially given the restrictive vocal options available to voiceless individuals. It’s also necessary to acknowledge the company has limitations that end up reproducing the structural inequities it tries to address. As Alper observes, “In order to become a speech donor, one must have three to four hours of spare time to record their speech, access to a steady and strong Internet connection, and a quiet location in which to record” (162-63). With these obstacles to donating one’s voice in mind, it’s not surprising that all the VocaliD recipient videos I could find feature white people. Donating one’s voice is much easier for middle to upper class white people who have access to privacy, Internet, and leisure time.

This brief examination of VocaliD raises questions about what a more equitable future for vocal technologies might look/sound like. Though I don’t have the answer, I believe that to understand the fullness of voice, we can’t look at it from a single perspective. We need to account for the entire vocal ecology: the material (biological, technological, financial, etc.) conditions from which a voice emerges or is performed, and individual speakers’ understanding of their culture, race, ethnicity, gender, class, ability, sexuality, etc. An ecological approach to voice involves collaborating with people and their vocal needs and desires—something VocaliD models already. But it also involves accounting for material realities: How might we make the barriers preventing a more diverse voice ecosystem less difficult to navigate—especially for underrepresented groups? In short, we must treat voice holistically. Voices are more than people, more than technologies, more than contexts, more than sounds. Understanding voice means acknowledging the interconnectedness of these things and how that interconnectedness enables or precludes vocal possibilities. 

Featured image: 366-350 You can’t shut me up, Jennifer Moo, CC BY-ND

Steph Ceraso is an associate professor of digital writing and rhetoric at the University of Virginia. Her 2018 book, Sounding Composition: Multimodal Pedagogies for Embodied Listening, proposes an expansive approach to teaching with sound in the composition classroom. She also published a digital book in 2019 called Sound Never Tasted So Good: ‘Teaching’ Sensory Rhetorics—an exploration of writing, sound, rhetoric, and food. She is currently working on a book project that examines sonic forms of invention in various contexts.

tape-reel

REWIND! . . .If you liked this post, you may also dig:

What is a Voice?–Alexis Deighton MacIntyre

Mr. and Mrs. Talking Machine: The Euphonia, the Phonograph, and the Gendering of Nineteenth Century Mechanical Speech – J. Martin Vest

Only the Sound Itself?: Early Radio, Education, and Archives of “No-Sound”–Amanda Keeler

Voice as Ecology: Voice Donation, Materiality, Identity

I first heard about voice donation while listening to “Being Siri,” an experimental audio piece about Erin Anderson donating her voice to Boston-based voice donation company, VocaliD. Like a digital blood bank of sorts, VocaliD provides a platform for donating one’s voice via digital audio recordings. These recordings are used to help technicians create a custom digital voice for a voiceless individual, providing an alternative to the predominately white, male, mechanical-sounding assistive technologies used by people who cannot vocalize for themselves (think Stephen Hawking). VocaliD manufactures voices that better match a person’s race, gender, ethnicity, age, and unique personality. To me, VocaliD encapsulates the promise, complexity, and problematic nature of our current speech AI landscape and serves as an example of why we need to think critically about sound technologies, even when they appear to be wholly beneficial.  

Given the extreme lack of sonic diversity in vocal assistive technologies, VocaliD provides a critically important service. But a closer look at both the rhetoric used by the organization and the material process involved in voice donation also amplifies the limits of overly simplistic, human-centric conceptions of voice. For instance, VocaliD rhetorically frames their service by persistently linking voice to humanity—to self, authenticity, individuality. Consider the following statements made by Rupal Patel, CEO and founder of VocaliD, in which she emphasizes the need for voice donation technology: 

“Here’s a way for us to acknowledge these individuals as unique human beings.” (Fast Company)

“I was talking to [a] girl we made a voice for. She told me that people are finally seeing her for who she really is.” (Medieros)

These are just a few examples from a larger discourse that reinforces the connection between voice and humanity. VocaliD’s repeated claims that their unique vocal identities humanize individuals imply that one is not fully human unless one’s voice sounds human. This rhetoric positions voiceless individuals as less than human (at least until they pay for a customized human-sounding voice). 

VocaliD’s conflation of voice and humanity makes me wonder about the meaning of “human” in this context. For example, notions of humanity have been historically associated with Western whiteness—and deployed as a means of separating or distinguishing white people from Others—as Alexander Weheliye points out. Though VocaliD’s mission is to diversify manufactured voices, is a “human-sounding” voice still construed as a white voice? Does sounding human mean sounding white? Even if there is a bank of sonically diverse voices to choose from, does racial bias show up in the pacing, phrasing, or inflection caused by the vocal technology? 

Photocredit: iphonedigital @Flickr CC BY-SA

I am also disturbed by the rhetoric of humanity and individuality used by VocaliD because the company adopts the same rhetoric to describe the AI voices they sell to brands for media and smart products. Here’s an example of this rhetoric from the VocaliD AI website: “When you need a voice that resonates, evokes audience empathy, and sounds like you, rather than your competitors, VocaliD’s AI-powered vocal persona is the solution. Your voice — always on, where you need it when you need it.” Using similar rhetorical strategies to describe both voiceless people and products is dehumanizing. And yet, having a more diverse AI vocal mediascape, especially in terms of race, is crucially important since voice-activated machines and products are designed largely by white men who end up reinforcing the sonic color line.

Interestingly, the processes VocaliD uses to create a custom voice reveal that these voices are not, in fact, unique markers of humanity or individuality. It’s hard to find a detailed account of how VocaliD voices are made due to the company’s patents, but here are the basics: VocaliD does not transfer a donated voice directly to a voiceless person’s assistive technology. VocaliD technicians instead blend and digitally manipulate the donated voice with recordings of the noises a voiceless person can make (a laugh, a hum) to create a distinct new voice for the recipient. In other words, donated voices are skillful remixes that wouldn’t be possible without extracting vocal data and manipulating it with digital tools. Despite perpetuating narratives about voice, humanity, and authenticity, VocaliD’s creative blending of vocal material reveals that donated voices are the result of compositional processes that involve much more than people.

Further, considering VocaliD voices from a material rather than human-centric perspective amplifies something important about voices in general. All voices are composed of and grounded in an ecology. That is, voices emerge and are developed through a mixture of: (1) biological makeup (or technological makeup in the case of machines with voices); (2) specific environments and contexts (geography may determine the kind of accents humans have; AI voices have distinct sounds for their brands); (3) technologies (phones, computers, digital recorders and editors, software, and assistive technologies preserve, circulate, and amplify voices); and (4) others (humans often emulate the vocal patterns of the people they interact with most; many machine voices also sound like other machine voices). Put simply, all voices are intentionally and unintentionally composed over time—shaped by ever-changing bodily (and/or technological) states and engagements with the world. Voices are dynamic compositions by nature. Examining voice from a material standpoint shows that voices are not static markers of humanity; voices are responsive and malleable because they are the result of a complex ecology that involves much more than a “unique” human being. 

However, focusing solely on the material aspects of vocality leaves out people’s lived experiences of voice. And based on online videos of VocaliD recipients—like Delaney, a seventeen-year-old with cerebral palsy—VocaliD voices seem to live up to the company’s hype. Delaney appears delighted by her new voice, stating: “I was so excited to get my own voice. I used to have a computer voice and now I sound like a girl. I like that. And I talk more.” Delaney’s teachers also discuss how her new voice completely changed her demeanor. Whereas before Delaney was reluctant to use her assistive technology to speak, her new voice gives her confidence and a stronger sense of identity. As her teacher explains in the video, “she is really engaged in groups, she wants to share her answers, she’s excited to talk with friends. It’s been really nice to see.” For Delaney, a VocaliD voice represents a newfound sense of agency. 

It’s important to recognize this video is not necessarily representative of every VocaliD recipient’s experience, or even Delaney’s full experience. As Meryl Alper notes in Giving Voice, these types of news stories “portray technology as allowing individuals to ‘overcome’ their disability as an individual limitation, and are intended to be uplifting and inspirational for able-bodied audiences” (27). While we should be wary of the technological determinism in the video, observing Delaney use her VocaliD voice—and listening to the emotional responses of her mom and teachers—makes it difficult to deny that donated voices make a positive impact. For me, this video also gets at a larger truth about humans and voice: the ways we hear and understand our own voices, and the ways others interpret the sounds of our voices, matter a great deal. Voices are integral to our identities—to the ways we understand and think about ourselves and others—and the sounds of our voices have social and material consequences, as the SO! Gendered Voices Forum illustrates so clearly. 

An image VocaliD used to advertise themselves on Twitter. Image used for purposes of critique.

It’s worth repeating that VocaliD’s mission to diversify synthetic voices is incredibly important, especially given the restrictive vocal options available to voiceless individuals. It’s also necessary to acknowledge the company has limitations that end up reproducing the structural inequities it tries to address. As Alper observes, “In order to become a speech donor, one must have three to four hours of spare time to record their speech, access to a steady and strong Internet connection, and a quiet location in which to record” (162-63). With these obstacles to donating one’s voice in mind, it’s not surprising that all the VocaliD recipient videos I could find feature white people. Donating one’s voice is much easier for middle to upper class white people who have access to privacy, Internet, and leisure time.

This brief examination of VocaliD raises questions about what a more equitable future for vocal technologies might look/sound like. Though I don’t have the answer, I believe that to understand the fullness of voice, we can’t look at it from a single perspective. We need to account for the entire vocal ecology: the material (biological, technological, financial, etc.) conditions from which a voice emerges or is performed, and individual speakers’ understanding of their culture, race, ethnicity, gender, class, ability, sexuality, etc. An ecological approach to voice involves collaborating with people and their vocal needs and desires—something VocaliD models already. But it also involves accounting for material realities: How might we make the barriers preventing a more diverse voice ecosystem less difficult to navigate—especially for underrepresented groups? In short, we must treat voice holistically. Voices are more than people, more than technologies, more than contexts, more than sounds. Understanding voice means acknowledging the interconnectedness of these things and how that interconnectedness enables or precludes vocal possibilities. 

Featured image: 366-350 You can’t shut me up, Jennifer Moo, CC BY-ND

Steph Ceraso is an associate professor of digital writing and rhetoric at the University of Virginia. Her 2018 book, Sounding Composition: Multimodal Pedagogies for Embodied Listening, proposes an expansive approach to teaching with sound in the composition classroom. She also published a digital book in 2019 called Sound Never Tasted So Good: ‘Teaching’ Sensory Rhetorics—an exploration of writing, sound, rhetoric, and food. She is currently working on a book project that examines sonic forms of invention in various contexts.

tape-reel

REWIND! . . .If you liked this post, you may also dig:

What is a Voice?–Alexis Deighton MacIntyre

Mr. and Mrs. Talking Machine: The Euphonia, the Phonograph, and the Gendering of Nineteenth Century Mechanical Speech – J. Martin Vest

Only the Sound Itself?: Early Radio, Education, and Archives of “No-Sound”–Amanda Keeler

SO! Amplifies: Immigrants Wake America Podcast and the Work of Engaged Digital Humanities

SO! Amplifies. . .a highly-curated, rolling mini-post series by which we editors hip you to cultural makers and organizations doing work we really really dig.  You’re welcome!

Conceptualized at a time of rampant increase in anti-immigrant violence, Immigrants Wake America is a creative response to the growing bias and violence against immigrant women in the U.S., as seen in the Atlanta shootings, the rise in hate crimes since the onset of Covid-19, and the US-Mexico border crisis. We believe that storytelling allows us to find similarities and differences between ourselves and others, offering a humanizing counterpart to harmful media narratives. The podcast creates a living archive of stories not yet heard, serving as an audio intervention into how immigrant women’s (hi)stories are narrated and passed on.

Tenement Museum in New York’s Lower East Side. Image by Flickr User Cliff Dix (CC BY-NC-ND 2.0)

Immigrants Wake America is a public humanities, community-engaged project of digital storytelling through podcasts, in partnership with the Tenement Museum in New York. It features storytellers who share their family stories about migration and the centrality of women in their life histories. These storytellers have submitted stories to the Tenement Museum’s digital archive Your Story, Our Story (YSOS),

Founded in 1988, the Tenement Museum, focuses on immigration and immigrants to “foster a society that embraces and values the role of immigration in the evolving American identity.” YSOS cofounded by Annie Polland and Kathryn Lloyd, is a digital archive that houses stories associated with immigration, migration, and cultural identity. Some of the storytellers are first generation immigrants, while others are descendants of immigrants, born and raised in the US; their great-grandparents or grandparents migrated to the US ages ago. Through YSOS, the Tenement Museum invites people across the country to share their stories in the online digital storytelling exhibit. Each story reveals one individual’s experience. Together, the stories help us see how the unique histories shape the nation, and the patterns that bind us together.

screencap of Your Story Our Story homepage

Through exploring and curating stories from Your Story Our Story, we facilitate conversations that supplement and expand it. This makes possible the conception of an archive that is both dynamic and collaborative. Such an archive resists the colonization and appropriation of lives and narratives of our storytellers. We navigate through the ethical conundrums that one might structurally and personally face in this collaborative endeavor. In our engagement with the archives at the Tenement Museum, we believe that our podcasting project really opens up the possibilities for an expansion of the archive.

We released our first episode, the Introductory Episode on January 15th, 2022, and have since been consistently releasing one episode per month.

While our podcast does not claim to retrieve or lay out these microhistories in their entirety, at an early stage of its development, we came to realize the potential that the form of the podcast itself offers for a different kind of storytelling. In our podcast, we treat stories as primary documents instead of marginalia. Michelle Caswell (2014) uses the term “symbolic annihilation” to describe the absence or misrepresentation of marginalized communities in archives. She advocates the powerful forces of community archives in countering “symbolic annihilation.” In thinking about archives in The Archaeology of Knowledge, Michel Foucault is concerned with “the density of discursive practices” wherein he observes “systems that establish statements as events and things (145)” This system of statements (as events or things) is what contributes to the law of what can be said. Processes of digital communal archiving such as those done by South Asian American Digital Archive (SAADA) or the Tenement Museum attempt to extend or expand the systematic possibility of events and things. Caswell and her colleagues have demonstrated the importance and success of the SAADA project. They have also pointed to the impossibility of representation in a traditional archive which is built on violence committed on colonized and enslaved bodies, also eloquently pointed out by Saidiya Hartman’s scholarship.

Through our experience we’ve learnt that podcasts can serve as a transgressive-dynamic expansion of digital archiving, given their unique ability to cut across racial and gendered lines of preconceived sonic notions and their potential to expand the current techniques and media of digital archiving. We map this formal potential of the podcast in the way it intersects with digital archiving in the following ways:

First, narratorial voice.

We wanted our project to act as an intervention in the way in which immigrant women’s (hi)stories are consumed and passed on. We wanted to provide counter narratives. It was essential that the storytellers share their stories in their own voices, literally! The audio medium allows us to produce a space for listening to voices that are otherwise marginalized and/or demonized. 

–Le Li and Shruti Jain

Among the several unique and inspiring stories of resilience that the Tenement Museum houses, one such is a story by an immigrant case manager at the American Civic Association in Binghamton, Goretti Mugambwa. The museum and our podcast make it possible for her story to be narrated by herself in her voice. With her experience of working with the refugee and immigrant community she also does not just remain an individual voice, but acts to further a collective assertion.

Next, sonic variations.

Our storytellers’ voices are not just “characteristics” of the story but are an essential part of the story itself. We believe that each immigrant and their descendent brings to the story their unique tonal texture. This diversity destabilizes what immigrants and their descendants are expected to sound like. The sounds we add in the editing process are minimal. We try not to impose emotional cues and responses upon our listeners. 

–Shruti Jain and Le Li

The multiplicity of voices in our podcast–and therefore in the archive–are not just “characteristics” of the aural storytelling or listening process, but are as much an essential part of the story itself. In line with what The Sonic Color Line reminds us, our work also finds that, “sound frequently appears to be visuality’s doppelgänger in U.S. racial history” (Stoever 4). This leads to the coding of race as not just visual but aural too. We want to clarify that the white constructed ideas of how people of color must sound flatten out the complexities in how people within and across communities do sound. At the same time, these notions of white sonic normativity also create a strong sense of what one must or must not sound like in order to succeed in the racial capitalist world order. The storytellers of our podcast and we ourselves are of diverse backgrounds. This, for us, is a way to demonstrate the “complex range of sounds actually produced by people of color” (Stoever 43). As Nancy Morales argues in “Óyeme Voz: U.S. Latin@ & Immigrant Communities Re-Sound Citizenship and Belonging,” the sound of ‘everyday voices’ mobilized against—and remarking on—the nation-state’s attempts to mark immigrant communities as vulnerable exerts an impactful and profoundly material agency.” With its conversational and collaborative format, our podcast serves as a dynamic medium to represent (his)stories that complicate generic conventions in critical ways.

Then, collaboration.

We have also been personally deeply impacted by the process of working on this podcast. We have made lasting bonds with our colleagues and storytellers alike. The storytellers of our podcast act not just as guests, but as collaborators and stakeholders. Instead of interpreting the stories in our own way and retelling the stories, we collaborate with the storytellers, and facilitate the unfolding of hidden stories by the storytellers. Dr. Lisa Yun, Professor of English at Binghamton University, and Kathryn Lloyd, Senior Director of Programs, Tenement Museum, have been advisors and the executive producers of the podcast. Together with Lloyd and Yun, we built a project on the ethos of collaboration.

The editing process of IWA too, is different. Rather than making individual editorial decisions, we engage the storytellers directly in post-production. After finishing a first edit of an episode collaboratively between ourselves, we then send it to the storytellers for their feedback and approval before releasing it. Sometimes, the storytellers do suggest changes. Based on their feedback, we re-edit the episode and eventually release it after the storytellers approval. We have also innovated methods of community editing, where we edit in groups of as large as 15 people.

Finally, accessibility.

The podcast medium makes Immigrants Wake America an ideal project for the public humanities. As opposed to lengthier podcasts, each episode of our podcast is edited down to 15-20 minutes. These can be used by educators as an in-class resource to generate discussion and activities. Community listeners could tune in during lunch breaks, get-togethers, cooking, driving or doing chores. Our episodes can also serve as conversation starters and help facilitate affective bonds among immigrants and non-immigrants alike.

The final episode of our first season, “Finding Our Grandmother in the Records,” aired just last week, and a second season is in the works.

As a way to expand this project, our second season will feature storytellers from our local community in addition to Your Story, Our Story. We plan to have units within our project dedicated to translation, recording and editing, and creating teaching resources. We aim for meaningful and engaged conversations and try to blur the supposed boundaries between the university and the community. Join us!

The first season of Immigrants Wake America was sponsored through the Institute for Advanced Studies in the Humanities at Binghamton University and a Public Humanities Grant from Humanities New York. Dr. Lisa Yun, Professor of English at Binghamton University, and Kathryn Lloyd, Senior Director of Programs, Tenement Museum, have been our advisors and the executive producers of the podcast. IWA is available on major streaming platforms such as Spotify, Google Podcasts, Apple Podcasts, Amazon Music, Soundcloud, and Audible.

Le Li and Shruti Jain are pursuing their PhDs at Binghamton University in the Translation Research and Instruction Program and the English Department respectively. They were Humanities New York Public Humanities fellows (2021-22) and graduate fellows of the Institute for Advanced Studies in the Humanities (IASH) at Binghamton University (2021-22). Through their podcast project and their work with digital community archives, Le and Shruti are currently working on exploring intersection between podcasts and digital archiving. They try to capitalize on the unique ability that the form of the podcast offers to cut across racial and gendered lines of preconceived sonic notions, which makes possible the conception of an archive that can be both dynamic and collaborative. Le’s research interests include translation studies, cultural studies, diaspora studies, and public humanities. Shruti’s PhD focuses on the Enlightenment, British Empire and the relationalities between race and caste formations. 

REWIND!…If you liked this post, you may also dig all this good stuff about sound studies pedagogy! Good luck with Fall semester, folks!:

“Heavy Airplay, All Day with No Chorus”: Classroom Sonic Consciousness in the Playlist ProjectTodd Craig

SO! Podcast #79: Behind the Podcast: deconstructing scenes from AFRI0550, African American Health Activism – Nic John Ramos and Laura Garbes

The Sounds of Anti-Anti-Essentialism: Listening to Black Consciousness in the Classroom- Carter Mathes

Making His Story Their Story: Teaching Hamilton at a Minority-serving Institution–Erika Gisela Abad

Teaching Soundwalks in a Course on Gentrification, Black Music, and Corporate America–Rami Toubia Stucky

Deejaying her Listening: Learning through Life Stories of Human Rights Violations– Emmanuelle Sonntag and Bronwen Low

Audio Culture Studies: Scaffolding a Sequence of Assignments– Jentery Sayers

Deep Listening as Philogynoir: Playlists, Black Girl Idiom, and Love–Shakira Holt

“Toward A Civically Engaged Sound Studies, or ReSounding Binghamton”–Jennifer Lynn Stoever

Listening to #Occupy in the Classroom–D. Travers Scott

SO! Podcast #71: Everyday Sounds of Resilience and Being: Black Joy at School–Walter Gershon

Sounding Out! Podcast #13: Sounding Shakespeare in S(e)oul– Brooke Carlson

A Listening Mind: Sound Learning in a Literature Classroom–Nicole Brittingham Furlonge

My Voice, or On Not Staying Quiet–Kaitlyn Liu

(Re)Locating Soundscapes of Schooling: Learning to Listen to Children’s Lifeworlds–Cassie J. Brownell

If You Can Hear My Voice: A Beginner’s Guide to Teaching–Caroline Pinkston

Mukbang Cooks, Chews, and Heals – David Lee

SO! Podcast #80: Refugee Realities Miniseries–Steph Ceraso

SO! Amplifies: Immigrants Wake America Podcast and the Work of Engaged Digital Humanities

SO! Amplifies. . .a highly-curated, rolling mini-post series by which we editors hip you to cultural makers and organizations doing work we really really dig.  You’re welcome!

Conceptualized at a time of rampant increase in anti-immigrant violence, Immigrants Wake America is a creative response to the growing bias and violence against immigrant women in the U.S., as seen in the Atlanta shootings, the rise in hate crimes since the onset of Covid-19, and the US-Mexico border crisis. We believe that storytelling allows us to find similarities and differences between ourselves and others, offering a humanizing counterpart to harmful media narratives. The podcast creates a living archive of stories not yet heard, serving as an audio intervention into how immigrant women’s (hi)stories are narrated and passed on.

Tenement Museum in New York’s Lower East Side. Image by Flickr User Cliff Dix (CC BY-NC-ND 2.0)

Immigrants Wake America is a public humanities, community-engaged project of digital storytelling through podcasts, in partnership with the Tenement Museum in New York. It features storytellers who share their family stories about migration and the centrality of women in their life histories. These storytellers have submitted stories to the Tenement Museum’s digital archive Your Story, Our Story (YSOS),

Founded in 1988, the Tenement Museum, focuses on immigration and immigrants to “foster a society that embraces and values the role of immigration in the evolving American identity.” YSOS cofounded by Annie Polland and Kathryn Lloyd, is a digital archive that houses stories associated with immigration, migration, and cultural identity. Some of the storytellers are first generation immigrants, while others are descendants of immigrants, born and raised in the US; their great-grandparents or grandparents migrated to the US ages ago. Through YSOS, the Tenement Museum invites people across the country to share their stories in the online digital storytelling exhibit. Each story reveals one individual’s experience. Together, the stories help us see how the unique histories shape the nation, and the patterns that bind us together.

screencap of Your Story Our Story homepage

Through exploring and curating stories from Your Story Our Story, we facilitate conversations that supplement and expand it. This makes possible the conception of an archive that is both dynamic and collaborative. Such an archive resists the colonization and appropriation of lives and narratives of our storytellers. We navigate through the ethical conundrums that one might structurally and personally face in this collaborative endeavor. In our engagement with the archives at the Tenement Museum, we believe that our podcasting project really opens up the possibilities for an expansion of the archive.

We released our first episode, the Introductory Episode on January 15th, 2022, and have since been consistently releasing one episode per month.

While our podcast does not claim to retrieve or lay out these microhistories in their entirety, at an early stage of its development, we came to realize the potential that the form of the podcast itself offers for a different kind of storytelling. In our podcast, we treat stories as primary documents instead of marginalia. Michelle Caswell (2014) uses the term “symbolic annihilation” to describe the absence or misrepresentation of marginalized communities in archives. She advocates the powerful forces of community archives in countering “symbolic annihilation.” In thinking about archives in The Archaeology of Knowledge, Michel Foucault is concerned with “the density of discursive practices” wherein he observes “systems that establish statements as events and things (145)” This system of statements (as events or things) is what contributes to the law of what can be said. Processes of digital communal archiving such as those done by South Asian American Digital Archive (SAADA) or the Tenement Museum attempt to extend or expand the systematic possibility of events and things. Caswell and her colleagues have demonstrated the importance and success of the SAADA project. They have also pointed to the impossibility of representation in a traditional archive which is built on violence committed on colonized and enslaved bodies, also eloquently pointed out by Saidiya Hartman’s scholarship.

Through our experience we’ve learnt that podcasts can serve as a transgressive-dynamic expansion of digital archiving, given their unique ability to cut across racial and gendered lines of preconceived sonic notions and their potential to expand the current techniques and media of digital archiving. We map this formal potential of the podcast in the way it intersects with digital archiving in the following ways:

First, narratorial voice.

We wanted our project to act as an intervention in the way in which immigrant women’s (hi)stories are consumed and passed on. We wanted to provide counter narratives. It was essential that the storytellers share their stories in their own voices, literally! The audio medium allows us to produce a space for listening to voices that are otherwise marginalized and/or demonized. 

–Le Li and Shruti Jain

Among the several unique and inspiring stories of resilience that the Tenement Museum houses, one such is a story by an immigrant case manager at the American Civic Association in Binghamton, Goretti Mugambwa. The museum and our podcast make it possible for her story to be narrated by herself in her voice. With her experience of working with the refugee and immigrant community she also does not just remain an individual voice, but acts to further a collective assertion.

Next, sonic variations.

Our storytellers’ voices are not just “characteristics” of the story but are an essential part of the story itself. We believe that each immigrant and their descendent brings to the story their unique tonal texture. This diversity destabilizes what immigrants and their descendants are expected to sound like. The sounds we add in the editing process are minimal. We try not to impose emotional cues and responses upon our listeners. 

–Shruti Jain and Le Li

The multiplicity of voices in our podcast–and therefore in the archive–are not just “characteristics” of the aural storytelling or listening process, but are as much an essential part of the story itself. In line with what The Sonic Color Line reminds us, our work also finds that, “sound frequently appears to be visuality’s doppelgänger in U.S. racial history” (Stoever 4). This leads to the coding of race as not just visual but aural too. We want to clarify that the white constructed ideas of how people of color must sound flatten out the complexities in how people within and across communities do sound. At the same time, these notions of white sonic normativity also create a strong sense of what one must or must not sound like in order to succeed in the racial capitalist world order. The storytellers of our podcast and we ourselves are of diverse backgrounds. This, for us, is a way to demonstrate the “complex range of sounds actually produced by people of color” (Stoever 43). As Nancy Morales argues in “Óyeme Voz: U.S. Latin@ & Immigrant Communities Re-Sound Citizenship and Belonging,” the sound of ‘everyday voices’ mobilized against—and remarking on—the nation-state’s attempts to mark immigrant communities as vulnerable exerts an impactful and profoundly material agency.” With its conversational and collaborative format, our podcast serves as a dynamic medium to represent (his)stories that complicate generic conventions in critical ways.

Then, collaboration.

We have also been personally deeply impacted by the process of working on this podcast. We have made lasting bonds with our colleagues and storytellers alike. The storytellers of our podcast act not just as guests, but as collaborators and stakeholders. Instead of interpreting the stories in our own way and retelling the stories, we collaborate with the storytellers, and facilitate the unfolding of hidden stories by the storytellers. Dr. Lisa Yun, Professor of English at Binghamton University, and Kathryn Lloyd, Senior Director of Programs, Tenement Museum, have been advisors and the executive producers of the podcast. Together with Lloyd and Yun, we built a project on the ethos of collaboration.

The editing process of IWA too, is different. Rather than making individual editorial decisions, we engage the storytellers directly in post-production. After finishing a first edit of an episode collaboratively between ourselves, we then send it to the storytellers for their feedback and approval before releasing it. Sometimes, the storytellers do suggest changes. Based on their feedback, we re-edit the episode and eventually release it after the storytellers approval. We have also innovated methods of community editing, where we edit in groups of as large as 15 people.

Finally, accessibility.

The podcast medium makes Immigrants Wake America an ideal project for the public humanities. As opposed to lengthier podcasts, each episode of our podcast is edited down to 15-20 minutes. These can be used by educators as an in-class resource to generate discussion and activities. Community listeners could tune in during lunch breaks, get-togethers, cooking, driving or doing chores. Our episodes can also serve as conversation starters and help facilitate affective bonds among immigrants and non-immigrants alike.

The final episode of our first season, “Finding Our Grandmother in the Records,” aired just last week, and a second season is in the works.

As a way to expand this project, our second season will feature storytellers from our local community in addition to Your Story, Our Story. We plan to have units within our project dedicated to translation, recording and editing, and creating teaching resources. We aim for meaningful and engaged conversations and try to blur the supposed boundaries between the university and the community. Join us!

The first season of Immigrants Wake America was sponsored through the Institute for Advanced Studies in the Humanities at Binghamton University and a Public Humanities Grant from Humanities New York. Dr. Lisa Yun, Professor of English at Binghamton University, and Kathryn Lloyd, Senior Director of Programs, Tenement Museum, have been our advisors and the executive producers of the podcast. IWA is available on major streaming platforms such as Spotify, Google Podcasts, Apple Podcasts, Amazon Music, Soundcloud, and Audible.

Le Li and Shruti Jain are pursuing their PhDs at Binghamton University in the Translation Research and Instruction Program and the English Department respectively. They were Humanities New York Public Humanities fellows (2021-22) and graduate fellows of the Institute for Advanced Studies in the Humanities (IASH) at Binghamton University (2021-22). Through their podcast project and their work with digital community archives, Le and Shruti are currently working on exploring intersection between podcasts and digital archiving. They try to capitalize on the unique ability that the form of the podcast offers to cut across racial and gendered lines of preconceived sonic notions, which makes possible the conception of an archive that can be both dynamic and collaborative. Le’s research interests include translation studies, cultural studies, diaspora studies, and public humanities. Shruti’s PhD focuses on the Enlightenment, British Empire and the relationalities between race and caste formations. 

REWIND!…If you liked this post, you may also dig all this good stuff about sound studies pedagogy! Good luck with Fall semester, folks!:

“Heavy Airplay, All Day with No Chorus”: Classroom Sonic Consciousness in the Playlist ProjectTodd Craig

SO! Podcast #79: Behind the Podcast: deconstructing scenes from AFRI0550, African American Health Activism – Nic John Ramos and Laura Garbes

The Sounds of Anti-Anti-Essentialism: Listening to Black Consciousness in the Classroom- Carter Mathes

Making His Story Their Story: Teaching Hamilton at a Minority-serving Institution–Erika Gisela Abad

Teaching Soundwalks in a Course on Gentrification, Black Music, and Corporate America–Rami Toubia Stucky

Deejaying her Listening: Learning through Life Stories of Human Rights Violations– Emmanuelle Sonntag and Bronwen Low

Audio Culture Studies: Scaffolding a Sequence of Assignments– Jentery Sayers

Deep Listening as Philogynoir: Playlists, Black Girl Idiom, and Love–Shakira Holt

“Toward A Civically Engaged Sound Studies, or ReSounding Binghamton”–Jennifer Lynn Stoever

Listening to #Occupy in the Classroom–D. Travers Scott

SO! Podcast #71: Everyday Sounds of Resilience and Being: Black Joy at School–Walter Gershon

Sounding Out! Podcast #13: Sounding Shakespeare in S(e)oul– Brooke Carlson

A Listening Mind: Sound Learning in a Literature Classroom–Nicole Brittingham Furlonge

My Voice, or On Not Staying Quiet–Kaitlyn Liu

(Re)Locating Soundscapes of Schooling: Learning to Listen to Children’s Lifeworlds–Cassie J. Brownell

If You Can Hear My Voice: A Beginner’s Guide to Teaching–Caroline Pinkston

Mukbang Cooks, Chews, and Heals – David Lee

SO! Podcast #80: Refugee Realities Miniseries–Steph Ceraso

The Sound of What Becomes Possible: Language Politics and Jesse Chun’s 술래 SULLAE (2020)

“To this day I think about all the strange words I missed out on, all the losses I’m still carrying from faraway…I still think of the time when I spoke one language, and that language was whole.”

Chun 2020

Language can be a site of loss, a wholeness with which one, due to migration, has never really known. In the above passage, artist, Jesse Chun, reflects on how her grandmother spoke words in a language she did not understand, but yearned to hear and feel those sounds after her passing. There is a sonic residue that sticks to diasporic experiences. There are sounds that can stir up a blend of affect and ideation that is comforting when whiteness is unsettling. It is this disjuncture between words, meaning, and their sounds, that drew me to Chun’s work, 술래 SULLAE (2020). This piece reminded me of how sound, in its most ambiguous and queer forms, can hold the contingencies of history, language, memory, family, and the genealogies of loss that mark these sites of colonial dispossession.

술래 SULLAE (2020) is a single channeled video that draws from ganggang sullae, a Korean seasonal harvest and fertility ritual that integrates song and dance and is typically performed by women under the glow of moonlight. The participants hold hands forming a circle that through their movement, expands, disassembles, and changes its form. The songs can be both impromptu or pre-determined and encourages the participants to express their feelings in chorus with one another.

Video made by the Cultural Heritage Administration of the Republic of Korea (2008) for UNESCO “intangible heritage” application

Diana Seo Hyung Lee (2020) suggests that historically ganggang sullae was meant to provide a forum for its participants to express emotions connected to living within patriarchal systems of power and oppression. She writes: “the women participating would not have been able to, in their everyday lives, sing, speak loudly, nor leave the house at night, in the patriarchal society of ancient Korea. This dance was a license for their one release.” In 술래 SULLAE, the dance proves to be a defiant presence. The women flash on screen as an unbreakable chain reinscribing a gendered history with new sounds and images that gesture to emancipatory possibilities.

술래 SULLAE combines archival clips of ganggang sullae, index pages from intonation books, images of Hangul and English consonants and audio splices from YouTube tutorials on how to pronounce English correctly. In the video, language becomes unhinged from expectation but at the same time, given form through history. The sound of the English language is disembodied and spliced into phonemic pulses.  In 술래 SULLAE, Chun has created an encounter with the grammars of polyphony; a simultaneity of sounds that are both restrained by and resistant to the imposition of English on the Korean diaspora. Through what Chun has described as a form of “unlanguaging” following Rey Chow, her audience is witness to new meanings produced through the abstraction, manipulation, and redaction of sounds and symbols from the English language.

Still from 술래 SULLAE (2020), courtesy of the artist

Chun’s editing and manipulation of English sounds is intentional. In an interview with Art Forum, Chun shares: “Taking the sound apart but still keeping it within the conceptual framework of English made me think about what else is embedded in making a language. English is tied up with legacies of imperialism; there’s so much unseen violence that is part of how this language is institutionalized.” What remains after the edits is an inventory of sounds that disrupts the primacy of the vowel as central to English word construction and thus, central to colonial imagination.  

Like Chun, I realize that my conceptualizing of language is within an English framework, but my hope is that when we turn to the affective and when we begin to pull language a part, something different, something resistant, is produced.  I am neither an expert in English nor Korean linguistics, it was the sounds in this work that pulled me into it. In thinking with 술래 SULLAE, I’m interested in what becomes possible in the absence of the vowel. I turn to the interruptive potential of consonant sounds to affect and incite methods of communication outside of those steeped in colonial dominance. What does it mean to de-emphasize the function of vowel sounds in language and reorient our listening to the consonant? What do consonant sounds teach us about the sonics of race that underwrite hierarchies of language? What methods of communication become possible when we do away with words and are left with only their sonic substance?

술래 SULLAE 3-channel excerpt

Through her assemblage of consonant sounds in 술래 SULLAE, Chun is making a deliberate choice to describe and animate a politics of language through refusing its colonial enclosures and turning to the aesthetic in order hold the excesses of description. She refuses the vowel in this piece, not by denying its presence, but instead relegating it to the soundless and the unfamiliar, a space of, in her words, “untranslatability.” In this undoing, consonants become the emotive force where new meanings and orientations to the sounds that mark our words are forged.

술래 SULLAE opens with the sound ssshhh; a pairing of consonant sounds that is often associated with insisting on silence, a sound meant to reprimand. Chun extracts and emplaces this sound in a new aesthetic landscape that is independent and unregulated by colonial schemas of enunciation and translation. The prominent soundscapes of the video are consonant sounds and when removed from their phonetic relations to vowels these sounds undo the presumptive structuring or potential reprimand of English. In 술래 SULLAE, we are meant to experience the fullness of the consonants’ timbre…ssshhh, ppp, ddd, tttt, kkkk…these edited clips of sound originally meant to instruct and assimilate speech into English pronunciations now serve a different function. For me, they secure Chun’s political orientation: one that is about the crafting of a world that involves the careful consideration of the logistics, function, and embedded emotions of the sounds that inhabit it.

score (for unlanguaging), 2020
graphite, watermarks, paper, aluminum frame, 13 x 16 inches

All languages contain their own unique set of vowels and consonants, but, Anne Carson reminds us that: “The importance of vowels to human speech has remained. There are words in English without consonants, but so central are vowels to word construction that there isn’t a word in English that doesn’t include a vowel.” In speech, consonants sounds are meant to break up the intended agenda of vowels. The ssshhh, ppp, ddd, tttt, kkkk, are antithetical to the circle or the rounded mouth needed to voice a vowel sound. Unlike the openness of a vowel, producing consonant sounds involves a narrowing of the vocal tract. This narrowing is referred to as constriction or the obstruction of breath whereby sound is produced by a form of corporeal tension. Consonant sounds also demand all the mechanics of the mouth: the lips, the teeth, the tongue, and the palette. Shhhh, requires the corners of the lips to lower and rather than rounding, the lips become pursed, and teeth become exposed. Parts of the mouth are drawn in. The soft palate is raised, and the tongue reaches upwards towards the roof of the mouth without touching it and then the tip of the tongue lowers behind the teeth.

Consonants emerge out of collectivity. Where a vowel is sounded without vocal constraint, consonants require more effort. Their sounds are produced through intricate bodily choreographies in the mouth that involve both constriction and collaboration. Ganggangsullae likewise relies on effort and interdependence. Participants collectively determine the speed and/or shape of their dance. They may even become serpentine or separate into smaller circles depending on what the group decides. The dance also provides an aesthetic space for its participants to voice frustration, anger, and tension through song with the hopes of producing reprieve from gendered hardships. Chun has decided to withhold these songs from her audience; we never hear the women singing. Through this erasure, Chun embeds the consonant sound with affective force whereby a politics of language and gendered presence is enunciated through and beyond a form of silencing. The dance redirects trajectories of dominance whereby the shushing takes on a new voice imbued with agency and hope. Because of how Chun isolates and amplifies its sound, ssshhhh is free to take on different meanings and associations. For me, I was reminded of rushing water or gusts of wind, or the sound used to lull my child to sleep. I was brought into another index of knowing and relating.

술래 SULLAE, 3-channel installation view, 2020, courtesy of the artist

The sounds of language hold erasures and layered histories often obfuscated by our mundane encounters with them. Largely understood as the most sonorous part of the syllable, vowels produce the loudest speech sounds and their capacity for holding larger amplitudes or louder volumes have been linked to the sonic expression of emotion. Consonant sounds are more pragmatic than vowels. They are known for their functionality, for the ways in which they assemble the semantic structure of words and for their capacities to hold vowels in place or as Anne Carson describes as “delineating meaning amid the flow of open vowel sounds.” Consonant and vowel sounds map out different functional trajectories by virtue of the shape of mouth and orientation of breath that these sounds demand. Like Chun, I’m interested in what political orientations become possible when we source emotion elsewhere, beyond the confines of spoken words imposed upon us.

The word consonant is a noun, a word used to identify or classify, a semantic enclosure that establishes a subject or object. But unlike the word vowel, consonant is also an adjective. A consonant possesses the capacity to describe, to name, to tell us more. Adjectives parcel out description on states of being, in this way, they are inherently phenomenological. In 술래 SULLAE, Chun empties vowels of their sonic substance leaving behind traces of fragmented characters and differently shaped circles in their wake. They are stripped of breath and their symbolic value forming a new method of communication that reroutes expectations of what language, as we know it, can do and sound like. Like, ganggang sullae, the vowel is premised on the shape of a circle, but in 술래 SULLAE, Chun provokes us to think about what becomes possible beyond the circular structuring device, what becomes possible beyond the purview of the violent embeddedness of English and its colonial exigencies.

new moons, 2020, graphite on wall, surveillance mirror, 18 x 6 inches each
untitled (ㄷ), 2020, a functional concrete stool, courtesy of artist

Chun has noted that the moon that hovers above the ganggangsullae is yet another site of imperial conquest. In Art Forum, Chun states: “when I look up at it to feel comforted or to find solace, I’m reminded of colonial violence and an agenda that’s projected onto it. In that way, the moon also reflected how I see language.” Chun’s turn to consonants signals a reshaping of the colonial frame that does not disavow or idealize the legacies of imperialism on systems of communication, but instead highlights the tensions and obstructions produced in its shadows.

Featured image: 술래 SULLAE, 2020, single-channel version, courtesy of artist

Casey Mecija is an Assistant Professor in the Department of Communication & Media Studies at York University. Her current research examines sound as a mode of affective, psychic and social representation, specifically in relation to diasporic experience. Drawing on sound studies, queer diaspora studies and Filipinx Studies, her research considers how sensorial encounters are enmeshed and disciplined by social and psychic conditions. She is also a musician and filmmaker, whose work has received a number of accolades and has been presented internationally.

tape-reel

REWIND! . . .If you liked this post, you may also dig:

Blank Space and “Asymmetries of Childhood Innocence”  –Casey Mecija

Re-orienting Sound Studies’ Aural Fixation: Christine Sun Kim’s “Subjective Loudness”–Sarah Mayberry Scott

Tape Hiss, Compression, and the Stubborn Materiality of Sonic Diaspora–Chris Chien

The Sound of What Becomes Possible: Language Politics and Jesse Chun’s 술래 SULLAE (2020)

“To this day I think about all the strange words I missed out on, all the losses I’m still carrying from faraway…I still think of the time when I spoke one language, and that language was whole.”

Chun 2020

Language can be a site of loss, a wholeness with which one, due to migration, has never really known. In the above passage, artist, Jesse Chun, reflects on how her grandmother spoke words in a language she did not understand, but yearned to hear and feel those sounds after her passing. There is a sonic residue that sticks to diasporic experiences. There are sounds that can stir up a blend of affect and ideation that is comforting when whiteness is unsettling. It is this disjuncture between words, meaning, and their sounds, that drew me to Chun’s work, 술래 SULLAE (2020). This piece reminded me of how sound, in its most ambiguous and queer forms, can hold the contingencies of history, language, memory, family, and the genealogies of loss that mark these sites of colonial dispossession.

술래 SULLAE (2020) is a single channeled video that draws from ganggang sullae, a Korean seasonal harvest and fertility ritual that integrates song and dance and is typically performed by women under the glow of moonlight. The participants hold hands forming a circle that through their movement, expands, disassembles, and changes its form. The songs can be both impromptu or pre-determined and encourages the participants to express their feelings in chorus with one another.

Video made by the Cultural Heritage Administration of the Republic of Korea (2008) for UNESCO “intangible heritage” application

Diana Seo Hyung Lee (2020) suggests that historically ganggang sullae was meant to provide a forum for its participants to express emotions connected to living within patriarchal systems of power and oppression. She writes: “the women participating would not have been able to, in their everyday lives, sing, speak loudly, nor leave the house at night, in the patriarchal society of ancient Korea. This dance was a license for their one release.” In 술래 SULLAE, the dance proves to be a defiant presence. The women flash on screen as an unbreakable chain reinscribing a gendered history with new sounds and images that gesture to emancipatory possibilities.

술래 SULLAE combines archival clips of ganggang sullae, index pages from intonation books, images of Hangul and English consonants and audio splices from YouTube tutorials on how to pronounce English correctly. In the video, language becomes unhinged from expectation but at the same time, given form through history. The sound of the English language is disembodied and spliced into phonemic pulses.  In 술래 SULLAE, Chun has created an encounter with the grammars of polyphony; a simultaneity of sounds that are both restrained by and resistant to the imposition of English on the Korean diaspora. Through what Chun has described as a form of “unlanguaging” following Rey Chow, her audience is witness to new meanings produced through the abstraction, manipulation, and redaction of sounds and symbols from the English language.

Still from 술래 SULLAE (2020), courtesy of the artist

Chun’s editing and manipulation of English sounds is intentional. In an interview with Art Forum, Chun shares: “Taking the sound apart but still keeping it within the conceptual framework of English made me think about what else is embedded in making a language. English is tied up with legacies of imperialism; there’s so much unseen violence that is part of how this language is institutionalized.” What remains after the edits is an inventory of sounds that disrupts the primacy of the vowel as central to English word construction and thus, central to colonial imagination.  

Like Chun, I realize that my conceptualizing of language is within an English framework, but my hope is that when we turn to the affective and when we begin to pull language a part, something different, something resistant, is produced.  I am neither an expert in English nor Korean linguistics, it was the sounds in this work that pulled me into it. In thinking with 술래 SULLAE, I’m interested in what becomes possible in the absence of the vowel. I turn to the interruptive potential of consonant sounds to affect and incite methods of communication outside of those steeped in colonial dominance. What does it mean to de-emphasize the function of vowel sounds in language and reorient our listening to the consonant? What do consonant sounds teach us about the sonics of race that underwrite hierarchies of language? What methods of communication become possible when we do away with words and are left with only their sonic substance?

술래 SULLAE 3-channel excerpt

Through her assemblage of consonant sounds in 술래 SULLAE, Chun is making a deliberate choice to describe and animate a politics of language through refusing its colonial enclosures and turning to the aesthetic in order hold the excesses of description. She refuses the vowel in this piece, not by denying its presence, but instead relegating it to the soundless and the unfamiliar, a space of, in her words, “untranslatability.” In this undoing, consonants become the emotive force where new meanings and orientations to the sounds that mark our words are forged.

술래 SULLAE opens with the sound ssshhh; a pairing of consonant sounds that is often associated with insisting on silence, a sound meant to reprimand. Chun extracts and emplaces this sound in a new aesthetic landscape that is independent and unregulated by colonial schemas of enunciation and translation. The prominent soundscapes of the video are consonant sounds and when removed from their phonetic relations to vowels these sounds undo the presumptive structuring or potential reprimand of English. In 술래 SULLAE, we are meant to experience the fullness of the consonants’ timbre…ssshhh, ppp, ddd, tttt, kkkk…these edited clips of sound originally meant to instruct and assimilate speech into English pronunciations now serve a different function. For me, they secure Chun’s political orientation: one that is about the crafting of a world that involves the careful consideration of the logistics, function, and embedded emotions of the sounds that inhabit it.

score (for unlanguaging), 2020
graphite, watermarks, paper, aluminum frame, 13 x 16 inches

All languages contain their own unique set of vowels and consonants, but, Anne Carson reminds us that: “The importance of vowels to human speech has remained. There are words in English without consonants, but so central are vowels to word construction that there isn’t a word in English that doesn’t include a vowel.” In speech, consonants sounds are meant to break up the intended agenda of vowels. The ssshhh, ppp, ddd, tttt, kkkk, are antithetical to the circle or the rounded mouth needed to voice a vowel sound. Unlike the openness of a vowel, producing consonant sounds involves a narrowing of the vocal tract. This narrowing is referred to as constriction or the obstruction of breath whereby sound is produced by a form of corporeal tension. Consonant sounds also demand all the mechanics of the mouth: the lips, the teeth, the tongue, and the palette. Shhhh, requires the corners of the lips to lower and rather than rounding, the lips become pursed, and teeth become exposed. Parts of the mouth are drawn in. The soft palate is raised, and the tongue reaches upwards towards the roof of the mouth without touching it and then the tip of the tongue lowers behind the teeth.

Consonants emerge out of collectivity. Where a vowel is sounded without vocal constraint, consonants require more effort. Their sounds are produced through intricate bodily choreographies in the mouth that involve both constriction and collaboration. Ganggangsullae likewise relies on effort and interdependence. Participants collectively determine the speed and/or shape of their dance. They may even become serpentine or separate into smaller circles depending on what the group decides. The dance also provides an aesthetic space for its participants to voice frustration, anger, and tension through song with the hopes of producing reprieve from gendered hardships. Chun has decided to withhold these songs from her audience; we never hear the women singing. Through this erasure, Chun embeds the consonant sound with affective force whereby a politics of language and gendered presence is enunciated through and beyond a form of silencing. The dance redirects trajectories of dominance whereby the shushing takes on a new voice imbued with agency and hope. Because of how Chun isolates and amplifies its sound, ssshhhh is free to take on different meanings and associations. For me, I was reminded of rushing water or gusts of wind, or the sound used to lull my child to sleep. I was brought into another index of knowing and relating.

술래 SULLAE, 3-channel installation view, 2020, courtesy of the artist

The sounds of language hold erasures and layered histories often obfuscated by our mundane encounters with them. Largely understood as the most sonorous part of the syllable, vowels produce the loudest speech sounds and their capacity for holding larger amplitudes or louder volumes have been linked to the sonic expression of emotion. Consonant sounds are more pragmatic than vowels. They are known for their functionality, for the ways in which they assemble the semantic structure of words and for their capacities to hold vowels in place or as Anne Carson describes as “delineating meaning amid the flow of open vowel sounds.” Consonant and vowel sounds map out different functional trajectories by virtue of the shape of mouth and orientation of breath that these sounds demand. Like Chun, I’m interested in what political orientations become possible when we source emotion elsewhere, beyond the confines of spoken words imposed upon us.

The word consonant is a noun, a word used to identify or classify, a semantic enclosure that establishes a subject or object. But unlike the word vowel, consonant is also an adjective. A consonant possesses the capacity to describe, to name, to tell us more. Adjectives parcel out description on states of being, in this way, they are inherently phenomenological. In 술래 SULLAE, Chun empties vowels of their sonic substance leaving behind traces of fragmented characters and differently shaped circles in their wake. They are stripped of breath and their symbolic value forming a new method of communication that reroutes expectations of what language, as we know it, can do and sound like. Like, ganggang sullae, the vowel is premised on the shape of a circle, but in 술래 SULLAE, Chun provokes us to think about what becomes possible beyond the circular structuring device, what becomes possible beyond the purview of the violent embeddedness of English and its colonial exigencies.

new moons, 2020, graphite on wall, surveillance mirror, 18 x 6 inches each
untitled (ㄷ), 2020, a functional concrete stool, courtesy of artist

Chun has noted that the moon that hovers above the ganggangsullae is yet another site of imperial conquest. In Art Forum, Chun states: “when I look up at it to feel comforted or to find solace, I’m reminded of colonial violence and an agenda that’s projected onto it. In that way, the moon also reflected how I see language.” Chun’s turn to consonants signals a reshaping of the colonial frame that does not disavow or idealize the legacies of imperialism on systems of communication, but instead highlights the tensions and obstructions produced in its shadows.

Featured image: 술래 SULLAE, 2020, single-channel version, courtesy of artist

Casey Mecija is an Assistant Professor in the Department of Communication & Media Studies at York University. Her current research examines sound as a mode of affective, psychic and social representation, specifically in relation to diasporic experience. Drawing on sound studies, queer diaspora studies and Filipinx Studies, her research considers how sensorial encounters are enmeshed and disciplined by social and psychic conditions. She is also a musician and filmmaker, whose work has received a number of accolades and has been presented internationally.

tape-reel

REWIND! . . .If you liked this post, you may also dig:

Blank Space and “Asymmetries of Childhood Innocence”  –Casey Mecija

Re-orienting Sound Studies’ Aural Fixation: Christine Sun Kim’s “Subjective Loudness”–Sarah Mayberry Scott

Tape Hiss, Compression, and the Stubborn Materiality of Sonic Diaspora–Chris Chien

Paths to Autonomy

Paths to Autonomy Edited by Noah Bremer & Vaida Stepanovaite Collection exploring the history and development of autonomous politics in Lithuania and Eastern Europe A path is created when a direction is taken, its production marks the imbrication of personal choice, communal action and subhuman (structural, historical, ecological) conditionings. We are at the same time the makers of our paths … Continue reading →

Paths to Autonomy

Paths to Autonomy Edited by Noah Bremer & Vaida Stepanovaite Collection exploring the history and development of autonomous politics in Lithuania and Eastern Europe A path is created when a direction is taken, its production marks the imbrication of personal choice, communal action and subhuman (structural, historical, ecological) conditionings. We are at the same time the makers of our paths … Continue reading →