Visual Methodologies #2

The digital research method of creating image grids

Aangereikt door:

Carlo de Gaetano, researcher at Lectorate Visual Methodologies.

Summary

Visual methodology is a form of qualitative research that relies on artistic mediums to produce and represent data. This method is different when compared to other research methods, with this methodology it isn’t immediately apparent what you will be able to do with your collected data. 

Visual methodology is usually applied during the exploratory phase of a research or design project. The collected data will give you insight on how your research topic is represented on the internet. Furthermore the collected data has the potential to give you a different perspective on the selected topic, which can then help you create new imageries surrounding said topic. These new imageries can help shine a new light on your research topic. In order to apply this methodology, you will need to use a couple of tools. In this example we explain:

DMI is used to scrape images from Instagram, besides scraping the images it also collects the metadata that comes with those images. There are alternatives that you can use such as Apify and Phantombuster. These scrapers can run into issues, because Instagram tries to prevent external applications from scraping their data, this is why scrapers need to be updated frequently in order to prevent Instagram from catching on. Memespector is used to add tags to your images such as ‘font’, ‘girl’, ‘beach’ etc., which you will later use to create a network of labels and images.The credential .json is necessary to run Memespector. Gephi will allow you to visualize your collected data by using the tags that Memespector created.

“When thinking of images online as digital objects we have to think beyond the single image. They are embedded in a digital context and they never travel alone.”

Method goal

As mentioned before, visual methodology is a form of qualitative research that relies on artistic mediums such as film, photography, illustrations and sculptures to produce and represent data. Besides revealing information on what the artistic medium captures, it also reveals information of the artist who created the medium, the context in which the medium is presented and other metadata that can be extracted from said medium. 

Once you have explored this method you are left with a clean overview of your collected data, both visually and textually. You will have an overview in the form of patterns and clusters of what lives on the internet regarding your research topic. It allows you to gain a better understanding of how people online view the topic and how differently subgroups interpret your research topic. With all of the collected data you will be able to not only see your research topic from a different perspective, but also decide the further direction of your research project.

When do you execute this method?

Visual methodology is mostly used in the exploratory phase of a research or design project. It can also be done later in your project, once analyzed the incoming data, to define the direction of your project.

How does it work?

Necessities

For this method you will need a computer with Excel, Google Sheets or any other program where you can import .csv files. Other than that you will want to have figured out which research topic you would like to explore, as well as the following software:

  1. DMI Instagram Scraper
    This software is used to download image data sets from Instagram.
  2. Memespector
    This software is built on top of Google Vision API, a computer vision algorithm that automatically tags images by looking at their content.
  3. Google Cloud Credential .json
    An account and this .json file is necessary to run Memespector.
  4. Gephi
    This software allows you to visually explore the images taken from the scraper. It does this based on the tags created in Memespector and will give you groups of images with similar content.

 

Optional is Adobe Illustrator or an online concept board where you can annotate your visual data.

Execution

Once all the software is set up you can start with the DMI Instagram Scraper.

  1. Fill in the # and/or @ you want to explore images from in the Query box.
  2. Enter the amount of images you would like to gather data from. (Test with 5 images in order to have results in one minute.)
  3. Select ‘Photo files’, as that is the data you want to collect.
  4. Enter your preferred file name.
  5. Choose a location where you want to save the file.
  6. And press ‘Start scraping’.

 

As this is an open source software it might not work the first time. Try restarting and/or waiting fifteen minutes. An alternative to this is Crowdtangle, however this is a paid product.

You are now left with a .csv file which you can open in Excel, Google Sheets, or any of your prefered programs which allow .csv import.

  1. Import your data.
  2. Select all cells that contain data and apply a filter to it by simply pressing the filter icon. This icon looks like a funnel and can be found on the right side of the toolbar.
  3. Sort the number of likes from Z to A. This allows you to order the images from most popular to least popular.
  4. To make the sheet visual you can add a new column next to your thumbnail URLs. In this new column you enter the function =IMAGE(x). Replace the X with the cell that contains the image url. So for example L1 and drag this function from your first image to the last of the list. To make the images larger you can resize the rows.

 

All of your data is now placed in a nice spreadsheet, but what can you do with this?

  1. Locate the column which contains the body of the posts. 
  2. Filter by condition and choose ‘Text contains’. 
  3. You now have to decide which keywords you want to explore. This all depends on your research topic.
  4. Once filtered by your chosen keyword you can select the row with the thumbnail URL and the image itself and place this on a new sheet.
  5. You can decide to hide the columns which contain the thumbnail URL, beware not to delete them as this will delete the image as well.

 

As a result of these steps you now have an overview of the top images from each keyword. From here on you can explore what it is that you are seeing and also what it is that you are NOT seeing. 

The next part of this method is done in Memespector, Table2Net and Gephi.

  1. Open Memespector and make sure your Google Credential .json file is placed in the right box.
  2. Check only ‘Label’.
  3. Select ‘Folder on this computer’ and choose your folder with your images scraped from the DMI Instagram Scraper.
  4. Pick a name for your files.
  5. Press ‘Invoke Google Cloud Vision API’.

 

You are now left with a .csv file with the tags that Memespector has given to your images. To create a network of images and labels you will use Table2Net.

  1. Import the right file which you created through Memespector.
  2. Type of Network: bipartite.
  3. First type of nodes: images, one expression per cell.
  4. Second type of nodes: tags, multiple items per cell, separated by ;.
  5. Download the .gexf file

 

The .gexf file you can open in Gephi. In order for this software to work you have to have the latest java running on your device. You can download this here.

  1. Select the .gexf file.
  2. Graph Type: Undirected.
  3. Deselect ‘Create missing nodes’.
  4. Choose ‘Append to existing workspace’ and press OK.
  5. Recolor the nodes by pressing the palette, partition and type. Once you have chosen a color you press Apply.
  6. Resize the nodes by numerical value by pressing the size button > ranking > occurrence count and play around with the sizes. Always press Apply to visualize your changes.
  7. To adjust the size on the labels based on the size of the node you can press the little triangle in the bottom right corner. Under ‘Labels’ you select ‘Node’ and set ‘Size’ to ‘Node Size’. Now you can play around with the size of the labels.
  8. To add images, you need to add a new column in the ‘Data Laboratory’ named ‘Image’. Select the column with the image file names and press ‘Copy data to other column’ and choose ‘Image’. 
  9. Download the Image Preview Plugin from Gephi and specify the path where your image folder is located on your computer.
  10. To make the images more visible you can filter the visuals. You can do this under Filter > Library > Attributes > Partition, drag ‘Type (Node)’ to Queries and select ‘Image_BaseName’. End by pressing ‘Filter’.
  11. Remove the image labels by selecting all rows in the ‘Data Laboratory’ and deleting the Label attribute on the left hand side. 
  12. The last step is to render the visual as an image. You can do this under the ‘Preview’ tab. Check ‘Render nodes as images’ and click ‘Refresh’.

 

All these steps will leave you with a .svg file full of images which can be very chaotic. The next step is to either print this or open it in Adobe Illustrator and analyse it. There you can find any interesting clusters that spark your interest and mark these.

What kind of data can you collect?

Images travel around the internet with certain metadata. This data includes hashtags, timestamps, likes, of course the picture itself and also, depending on the platform, mentions of other users, captions, comments and so on. Images are part of a larger network and this method can help find links between different ones. A single image may become closer to another because they are posted with the same hashtag, for example.

Visual Methodologies can, when working correctly, process a large amount of data really quickly which can be collected and observed thoroughly. Using the right tools, these methods allow you to filter and gaze through the top results. This lets you not only look at the top results, but also see how these change over time. Another interesting way of looking at the data would be to figure out what is NOT in the top, maybe you expected something particular out there.

Other interesting data you can collect with this method is a view of different clusters of images which are popular within your research topic. This can give you very useful insights which you can use further on in your research project.

How long will it take?

The time spent on this method can differ tremendously. Not only because you can do as little or as much as you like, but also because it depends on how well the technology is working in your favour. Coming back to the former, this method can be done on several different levels. Simply scraping the images, putting these in a spreadsheet and analysing those images can by itself give very good insights. If you want to dive deeper into the data it is possible to work with Memespector and Gephi, which will of course add more time to the process.

Just collecting the data will take about an hour. Observing it can take up to anything between one hour and several days/weeks. This all depends on how much data you want to analyse.

Level of difficulty

Reading through all this information and all the steps a person has to go through, this method can quickly sound very complicated. However, once you get the hang of it, it does become easier.

A lot of the programs that are used in this method are Open Source, which means you can download and use them for free. This also means it is usually more buggy than when you buy a certain program for money, because there is an expectation for support from the maker.

Designers might usually not work with spreadsheets and big files filled with data in their day-to-day work. This means it can be difficult to start with, especially when you have to follow certain steps, because there is a possibility it might not work otherwise. This feeling comes back multiple times when using the next program. 

This method is definitely manageable, especially if you follow the guide above, it will make it very clear. But it does take some time and research to really understand the steps you are taking. You can build an amazing network of connected images and keywords or hashtags, but don’t expect it to work immediately. Just like Rome wasn’t built in one day, your image network won’t either.

Pro’s and con’s

So what are the actual pro’s and con’s to this method? Let’s list them out.

Pro’s

  • You get a visual view of images, which makes further research easier.
  • Once you have the network filled with pictures, analyzing it can be quite some fun! You get to look at the pictures, see connections, and you might even find pictures you definitely did not expect to find!
  • Luckily, the work you have to do in a spreadsheet document is not a lot, it’s only a few steps that are very manageable.

Con’s

  • Since most of the software used is Open Source, it can be quite buggy and there is no support system.
  • The interfaces of the software are also not always easy, so it can be difficult to get used to this at first.
  • If you have an older PC or laptop, Gephi might have a difficult time running smoothly on your device, which can mean you’ll have to get into the configuration code yourself and change a few settings to give the software more memory to work with.

Evaluation and results

Overall, this method is very interesting. You get to do some actual deep digging research through data and are left with a very visual overview. This is not usually the case when working with big data files, the most visual it gets is a line graph! It is definitely worth taking the time for, because you can find so many wonderful insights in the result, it just takes some time to get to the end result!

These results can be hugely beneficial for your research project, especially in the exploratory phase. It can help you determine in which direction to continue. On the other hand, if you’ve already specified a specific direction, the Visual Methodologies with Gephi can give you some valuable insight in the topic you are researching.

Related methods

Photovoice

Photovoice is a method in which participants take and collect photos themselves. These images surround a certain theme and are meant to spark a discussion, which is followed by a mapping session done by the participants. In a way this means that you do exactly what Gephi does by looking at the images, but manually and it also allows you to look at the meaning behind an image instead of just the looks of it.

Inspiration Wall

With an inspiration wall, you organize ideas during a project. This almost looks like a physical alternative to Gephi. You print out pictures and other inspirational ideas that you can then elaborate on. This is a good way to collaborate with users and is therefore more co-creation than data analysis. If the work with Gephi is too complicated or boring, you can see this as a good alternate method.

Trend Analysis

Instead of scraping Instagram and using software, all of this can be done manually. But, even when doing this with the used software, you can also do it with different goals. Instead of using visual methodologies as a way to find out the ‘known unknowns’ you can also use it to actually analyse trends and find connections. Afterwards, you can continue diving in to the trends and do your research with that.

Projects worth mentioning

Project All Gone

‘Project All Gone’ is a series of experiments with artificial intelligence (A.I) that builds on an existing collection of climate fiction to create much needed new climate imaginaries. The reason why the researchers have chosen A.I is because co-authoring with A.I gives them the chance to break free from major existing narratives and allows them to refresh their imagination of the future with a changing climate. The researchers have already researched climate change and climate scepticism as it is discussed and depicted on the web and across platforms such as Wikipedia and other social media. They have looked into and worked with climate imagery, climate movements, NGOs and how people tell stories about the climate. Central to all climate-related discussions are future projections, Where are we headed if we don’t change our ways? What will disappear? Which parts are the most vulnerable to the effects of climate change? What is at stake for the near and distant future? Most of these projections created in scientific literature, journalism, or the entertainment industry are presented to us over and over again until they stick. 

Turning to the birds

‘Turning to the birds’ is a podcast that is derived from the All Gone Project. It is written by A.I. and edited by humans at the Visual Methodologies Collective. The collective generated lists of popular works of climate fiction, which allowed them to let the A.I. machine write a short story by entering dates in the future. The machine was then able to complete a story for that specific date. It was decided that humans would correct the draft, only slightly, for sake of legibility. Text-to-speech technology was used to turn the diary into a podcast series. You can listen to this on several different platforms such as Apple Podcasts or Spotify.

Aidan Omwando, Beate van Garderen & Iris van Ollefen