THE SEEM

Field experience as a cultural mediator led me to the hypothesis about how ICT software was capturing narratives. In the experiment, participants watched a video and described it three times:

stage 1 out loud in Acholi
stage 2 written on a mobile device in Acholi
stage 3 out loud in English

I predicted a pattern of similarities between the the written Acholi version and the oral English version. I thought that because the software had been developed from a western cultural perspective, that even when the interface was translated for users, they would have to get into an English mindset to engage with the logical organization of the software. And because of this effort at a cognitive level, the narrative they would produce via the ICT would be more similar to the oral English version and less similar to the oral Acholi version. In this way it could be said that the Acholi narrative was disrupted.

hypothesis: that narrative given via ICT in Acholi would be more similar to performance in English than to oral Acholi. [stated as a null hypothesis: The oral Acholi version will resemble the ICT Acholi version]

The apples and oranges bit that makes statistical modeling a challenge is two-fold. First, the experiments I adapted were comparing only speech (similar to my stage 1 and stage 3).... more like apples to apples. And secondly, each of their participants provided similar narrative content, at least, similar enough to compare... so here, oranges to oranges. In my case, even though I controlled the topic, the video, one participant remembered seeing a dispute instigated by a woman while another remembered seeing a group of government thugs enforcing curfew. Furthermore, some participants provided only a few phrases while others talked for several minutes, many paragraphs of data. Their descriptions of the video clip are barely in the same galaxy.

The plan was to compare each narrative stage by analyzing frames. A frame refers almost to a fragment or phrase constructing the narrative of, in this case, an event. Take for example the frame, I saw fighting and I saw two men fighting. While similar, these would be considered two different ways of framing because of the added detail of the number of actors, the two men.

The frames, such as giving a general overview followed by reference to who was in the frame-- I saw fighting. I saw people fighting-- would be considered two frames. If the details in these frames are not markedly different between any two stages, then they are counted as the same even if the syntax is different such as I saw a fight vs. They are fighting.

When differences occur, they are not only counted but ranked for importance. Based on the narrative structure in Acholi (used as a baseline/norm and derived from this study as well as literature), the general category, then the participants, then details of actions and location are given hierarchical importance. So for example, if two versions match until a detail of location, if one mentions a location and the other does not, they may still be considered a match if the general scene category and description of participants and actions match. If the locations differ entirely, one event is recalled as being in a market and one at a taxi stage, then they will likely be considered different narratives. For shorter narratives, most if not all frames and details must match.

Another consideration is tone. Throughout stage 1 and 3, the space that an oral modality gives for adding extra words, words which convey the (un)certainty of the speaker about his/her memory play a role in strength of the witness statement. However, both stage 2a and b do not afford this same linguistic wiggle room. There were built-in points to express doubt in stage 2b, and they are a point of focus in analysis. Stage 2a was an open SMS format, so it was up to the participant to inject the intended level of (un)certainty. Divergence in tone, in expressed level of doubt, constitutes a means for differentiating narratives as well.

For stage 2 (a and b), it gets a bit trickier to compare written results with spoken... and sometimes only a ticked box for yes or no. But this is the meat of the question. Does the ICT format adequately capture the categories and concepts for the event frames? (Many participants in my experiment were reluctant to hold the device, type themselves, and often spoke the answers aloud which I recorded with an audio device and typed for them. The data collected via the mobile device in stage 2 was not, therefore, strictly written. Much of it was still spoken.)

For stage 2b to be judged a match with another stage, the answers should convey similar information to that given in one of the oral narratives without contradicting or adding new information. If the reader had only the information from stage 2b, could s/he reasonably imagine a scene as told in one of the other narrative versions? If one of the oral narrative versions has several other frames or details not captured by stage 2b, then no, but if the sense of the event framing such as ‘theft with violence’ was captured by the answers to stage 2b’s question series, then it can be reasonable concluded that stage 2b adequately captured that narrative. (If it sounds complicated, you're not wrong. As I have said before, this was a first attempt at bringing a problem to the surface so it can be better understood, studied, and addressed... so far, lots of good indicators plus heaps of ideas to improve future trials.)

Because of the ordering of the stages, stage 2a often repeated much of what was said in stage 1. For psychology experiments, the effects of ordering are considerable and are usually countered by mixing up the stages; however, in this case, the role of priming was anticipated. The structure of the experiment captures the real-life field experience where an individual recalls an event orally and then creates a written record of that event with ICT software, often with the help of a trained information collection specialist. Priming effects would counter my hypothesis, in other words, I almost stacked the deck in favor of my own hypothesis failing. So if the results show my hypothesis still has value, there must be a very strong force at work.

Of the instances that were considered 'ICT success,' most came from participants who followed the stage 2a path. This may have been because it was a quick experiment and easy to simply repeat in written form what you had just said out loud. It is fresh in your mind. The differences came in the form of tone as well as more obvious detail and frame shifts. The tone change is interesting because in the oral account, the speaker may have said, I saw some fighting like maybe there was this guy who caused a problem or maybe it's a taxi conductor wanting change.... There was doubt and alternatives expressed. In stage 2a, the version was clear and direct. I saw one guy hit another guy in the street. The versions often make it sound much more like the participant is sure of the identity and ready to accuse someone to the police; whereas the oral version is not accusatory and gives a couple of possible scenarios for what prompted the chaos. If I only read the ICT version I would believe there was an actionable threat, a report of violence that needed a response. If I heard the oral version, I would not be as concerned. The aggregation of this type of report, stripped of the measured tone of the speaker can heighten perceived threat levels unnecessarily.

I will spend more time on this topic in further posts as well as talk about the results of my frame analysis. (Two other colleagues are reviewing my results. Independent evaluation.) But preliminary conclusions are that my hypothesis was correct. So that's cool. Other results point towards the strong division between oral and written narrative structures, something we all kinda know intuitively and there is a ton of research on how speaking is different than writing, so it's reassuring that results stayed with this pattern... the interesting parts come in looking at

I'm pretty sure there aren't muscles in my brain. But I swear I can feel teeny tiny sinuous straining in my head when I stare at my data. Raking over the pages. Dicing it up. Piecing it back together in new combinations. Milking every ounce of remotely interesting reportable information from it. Participants 5 and 9 mentioned the word BLUE. great. no idea what that means. yet.

Two things are in the back of my mind: Daniel Dennett's latest book, Intuition Pumps and Other Tools for Thinking, because one of the early chapters is about failure. Strategic flops. He talks a lot about the toil of generating ideas. And it does feel like a rare and wonderful thing that we (people who have the luxury to just sit around and ponder) get a problem that really strains those brain muscles. Data analysis has also brought to mind an image of sculptors from another age who could see their design within the marble while slowly chipping away toward it. I am reasonably confident that somewhere within all the data I've collected, if I sift and sort enough, the description I envisioned at the start is poised to emerge. What I don't know is what tools I need to get there. And then again, if my first pass at designing an experiment like this wasn't quite right, was it a strategic flop? Did it generate anything that will lead me in the right direction next time or provide a least (fingers crossed) a partial picture?

When I initially designed my experiment, (The experiment was conducted in three stages. Participants were asked to watch a short video on a laptop from Youtube titled, ‘Crazy Nigerians’ which they might watch alone or in pairs. They were asked to describe what they had seen in Acholi, ‘Lok ma ineno/ Tell me what you saw,’ then they were handed a mobile device to answer question(s) about the video, and finally they described the scene again in English.) I knew that it might not work (the risk of all experiments), but I hoped it would provide a certain kind of data, a certain kind of description of the situation I was investigating. Unlike previous qualitative work, this experiment might be more useful to software engineers. Additionally, it narrows the focus from language or communication generally to event construal, or a type of cognitive connection to language. How we conceptualize an event in time and space, who was there, etc... how this is stored in our memory and recalled through language has been shown by previous bilingual research to vary by language, but what happens when ICT is thrown in the mix?

As mentioned in previous posts, I modeled my experiment on the work of several others in the field of cognitive linguistics. In particular, Aneta Pavlenko (the new president of the American Association for Applied Linguistics) and Christiane von Stutterheim an expert on narrative. Once my results take shape, I will integrate comments from members of the software development community. Just as I have done with earlier papers, writing to engage several disciplines is one of the biggest challenges. I am by no means trying to please everyone, but at least get my point across and be prepared to answer reasonable criticism.

In subsequent posts, I will be addressing initial themes as well as analysis techniques since this is a new, although adapted, method. So far... the emergence of an alternative hypothesis indicating narrative disruption, the identity of the attacker in the video, and the conveyance of doubt/uncertainty have emerged as the main themes.... Stay tuned.

Thursday, 19 December 2013

Original and Extra...Complicated

Monday, 2 December 2013

Apples and Oranges and Acholi

Seams

Popular Posts