Workflows

BT LITE

In case you want to explore gene expression across different datasets via a comparative visalization, BT LITE is a good starting point. It mainly allows you to visualize your data via

  1. a heatmap showing general dataset coverage across brain regions

  2. heatmaps of gene expression of selected genes on brain region and dataset level

  3. parallel coordinates of genome-wide average gene expression on brain region and dataset level

BT Lite overview

1. Overview heatmap

The general heatmap is the starting point of further investigations: It shows the distribution of image/sample count for datasets (rows) and brain regions (columns). The darker the higher the count, a white element indicates that no data is available for this combination of dataset and brain region. Datasets can be subdivided by the meta data categories age category, cell type, phenotype, genotype, strain or extended annotation. In this case, a row shows the dataset coverage per category groups.

BT Lite heatmap

By clicking on the parcellation browser (left side), you can subdivide or merge regions based on the brain ontology. If you hover over an element of your heatmap, you see the current brain region and the original sample region anottation. This information can be used to estimate its relevance for the user. Besides that, you get a summary of the image/sample count, the dataset category = name of the dataset plus maybe a meta data category and the amount of covered meta data categories per category.

BT lite region split

2. Gene expression heatmaps

To investigate the expression of selected genes, you (1) select specific entries of your heatmap by clicking on them, (2) select the “Expression of genes” tab and (3) select some genes. (4) Then you select what your rows and columns should show. Several heatmaps appear, on the left side (1) a big overview heatmap, on the right side (2) small multiples are visalized to see the details. The colours are dataset dependent as the values can’t be compared directly.

BT lite gene expression

3. Parallel coordinates

To analyse your data on a genome level, you (1) select some datsets and regions by clicking, (2) you select the genome-level analysis tab and then a parallel coordinates system appears. Every line is a gene. (3) For every dataset-region combination you can adjust the range you want to see. You also get a gene list of your filtered genes.

BT lite genome level

The technical background and more informations can be found in our publication (Ganglberger et al, 2023)

Browse Database

The Browse database tab can be a starting point for all other workflows. Here, one can browse all data in BrainTrawler’s database via a text search or by browsing through the different tabs.

Browse database

You can see your datasets, genes, networks, 3D images, celltypes and sampleregions. You see some information on the dataset (name, species, method, …). For some items (network and 3D image), it is possible to add/remove them to/from the Workspace List:

Star and eye

A click on the star icon next to an item adds/removes it to/from the Workspace List. The eye icon next to the star icon will add it to the Workspace too, but also visualizes it directly. This is further indicated by the eye icon in the Workspace List (the eye icons correspond to each other). If the eye icon is active, it means that an item is visualized in the current workflow, and therefore it is in the Viewer Items List. Or simple: Star Icon → Item to Workspace List. Eye Icon → Item to Workspace List and Viewer Items List.

Note

Items that you add as visible to Workspace List (with the eye icon) will be automatically set visible in the Network Query workflow. This is because the Network Query workflow is usually the next workflow after Browse Database, so this saves a few clicks.

Note

Currently only neuron related GO terms are in the database.

Select gene from Browse Database tab

Select gene from database

To add genes from the database to the workspace you do the following:

  1. Search for a gene and click on it.

  2. Scroll down on the right side until.

  3. Select your gene from a dataset and add it to the workspace.

Network Query

A network query can be conducted in the “Network query” tab.

Network query

Network queries, such as target or source connectivity queries can be executed as follows:

  1. Select a connectivity matrix (network) and a “Volume of Interest” (VOI). This VOI can be one or more brain regions and/or a brush drawing with a certain radius (see also Visual Queries). Brush drawings can be initiated via the “brush” button (initiating a new brush will remove all brushes/selected brain regions). The selected VOI then acts as an input for a target or source query on the selected connectivity matrix (network).

  2. Click the “Target Query” or “Source Query” Button, BrainTrawler retrieves the connectivity to all voxels that are either targets or sources from the selection, and then returns a query item (connectivity from/to VOI on voxel-level).

  3. The result will be rendered instantly in the
    1. Render View and

    2. as Connectivity Profile. This profile represents the cumulated connectivity to (target) or from (source) the selected VOI. The Query Item gets automatically assigned with the name and color of the largest anatomical brain region in the VOI. Different tabs allow the user to switch between profiles of visible queries/gene expressions.

Afterwards, you can continue and set a high-intensity VOI

High-intensity VOI

High intensity voi

In the connectivity profile (see also Profiles), one can define a VOI with high intensity and do a 2nd order network query:

  1. Select regions of interest (e.g. with high connectivity)

  2. Click on “Set high-intensity VOI”

  3. Choose a threshold in the voxel-level connectivity histogram pop-up, which adds all voxels within the region, and a connectivity above the threshold, as VOI.

Afterwards, you can perform a target/source query on the selection, the resulting second-order connectivity is visualized in 2D and 3D, while the associated Connectivity Profile is rendered again in a tab below the Render View.

1st and 2nd order network query

As shown in the figure, one can determine the origin of a 2nd order network query by having a view at the text above the connectivity profile.

1st and 2nd order network query

An alternative view of the profiles can be rendered by clicking on the “bars” - icon next to the tabs of visible profiles where higher order Connectivity Profiles will be shown below their originating Connectivity Profile.

Typical use case network query

The user selects a structural connectivity matrix in the Query Toolbar, and brushes a VOI in the Render View (for example a region with high gene expression). A click on the “Target Query” button executes the select VOI. The accumulated connectivity is instantly displayed in 3D and 2D and as Connectivity Profile. This process is repeated with a functional connectivity matrix. The user then compares both outgoing connections in the Render View and in the Connectivity Profiles.

Note

Initiating a new brush is (currently) to remove any selection of a VOI that has been made.

Network Analysis

To analyse single networks or to compare multiple ones, you go to the “Network analysis” tab.

Local connectivity (query items, as a result of the Network Query worflow) or global whole-brain connectivity (a network item directly visualized) can be analyzed and compared as region-level networks on different anatomical scales. Instead of the Query Toolbar in the Network Query workflow, it shows information and settings for the network visualization.

Network analysis

For every visible network item, a histogram of its edges weights shown. The colors of the histogram bars map directly to the visualized edges and can be set in the Viewer Items List. The color directly relates to connectivity strength. Since the number of edges increases quadratically with the number of nodes, one can apply thresholding on edges to hide weaker connections: The threshold can be set with the slider below the histogram.

There are also some further settings:

Scale Color Above Threshold: one can scale the color between the theshold and the maximum value.

Hide Source Nodes and Hide Target Nodes further allows to hide source or target nodes of a network if one wants to focus on these individual parts.

Network Nodes to Brush adds the voxels of all visible nodes of the network to the selection for a Visual Query.

There is also the possibility of a comparison and joint exploration of multiple graphs , which relate to the same anatomical parcellation. This has been realized by overlaying several graphs, i.e. simply rendering multiple edges between nodes. Since showing only two graphs even renders up to four arrows between nodes, one can use an overlap visualization (“Visualize Overlap” checkbox) to emphasize on connections that are strong in multiple networks. Here, only a maximum of two edges per node needs to be rendered (i.e. two for directed, one for undirected). Their weights are defined by the multiplication of all edges (at regionlevel) between those nodes. Unlike an overlap that is based on the presence of connections (i.e. it renders binary edge weights if they are above certain thresholds in different networks), it provides edge weights on a continuous scale to visualize contrast between weak/strong connections. If an overlap is computed, the histograms depict a formula of the multiplication of individual graphs. Cascading queries are treated as a single network, depicted by brackets in the formula , since they represent connectivity of different orders.

Network analysis two networks

So, summed up, to compare networks you have to:

  1. Click “show graph representation” for the networks you want to compare

  2. Click “visualize overlap”

Then, you can play around with your network as before.

Note

Query items from the Network Query worflow will be automatically set visible in the Network Analysis workflow. This is because the Network Analysis workflow is usually the next workflow after Network Query, so this saves a few clicks.

Gene Expression Query

Gene Expression Queries can be performed to see which genes are specific for certain brain regions. The workflow is similar to a Network Query.

Gene expression query
  1. Select a dataset and a “Volume of Interest” (VOI). This VOI can be one or more brain regions and/or a brush drawing with a certain radius (see also Visual Queries). Brush drawings can be initiated via the “brush” button (initiating a new brush will remove all brushes/selected brain regions). The selected VOI then acts as an input for the gene expression query.

  2. Select one of the four available queries Expression query, Region-Specificity query, celltype-specificity query, Enrichment query. More details are given below.

  3. The result will be rendered instantly in the
    1. Render View and

    2. as a Gene list giving the name of the gene (name, fullname and entrez id), the species, the probability of being expressed, the value of the expression and the rank.

There are four different types of Gene Expression Queries that represent different types of normalization:

  • Expression Query: Genes ranked by their mean gene expression within the VOI (for example: in case of the Allen Mouse Brain Atlas: expression energy)

  • Fold-Change to Brain Query: We compute the fold-change to the rest of the brain: mean gene expression within the VOI DIVIDED by the mean gene expression of the whole brain.

  • Celltype-Specificity Query: This query can be used to see how specific the expression of a certain celltype is. Note that this query makes only sense, if a filter based on a celltype is applied. In this case, the mean gene expression within the VOI (similar to the Expression Query) is computed, and normalized by the expression over all celltypes (filters not concerning the celltype are still applied): mean expression in the brushed region DIVIDED by the mean expression of the brushed region for all celltypes.

  • Enrichment Query: Generalization of celltype-specificity query. This query can be used to see how specific the expression of the selected filter categories is. Note that this query makes only sense, if a filter is applied! In this case, the mean gene expression within the VOI (similar to the Expression Query) is computed for all samples of the selected filter/categories, and normalized by the expression over all samples (filters not applied): mean expression in the brushed region for the filter/categories DIVIDED by the mean expression of the brushed region for all samples.

Note

The Gene Lists contain only genes with expression/specificity>0! That’s why the do not always show all genes in the database.

Note

If datasets do not span the whole brain, z-score has been computed for the regions of the respective dataset (and not the whole brain).

You can view genes and select them to the workspace by clicking on the eye/star icon in the gene list. That way a new tab for the gene appears.

Select genes from a gene expression query

Then, similar to Network Query second order queries can be done based on high intensity VOI:

High intensity voi gene expression

Gene Expression Analysis

Gene expression query list items (i.e. lists of genes sorted by gene expression) reveal which genes are specific for a certain VOIs. Since every gene list consists of several thousand genes, an efficient comparison and combined filtering of these lists can be done with a parallel coordinate system.

Note

You have to perform Gene Expression Queries first before you can perform the gene expression analysis in this workflow/tab!

Gene expression analysis

A parallel coordinates system allows the filtering of multiple gene lists from different brain regions by their gene expression. Each axis represents a gene expression query, and as a consequence the gene expression for the query regions (the color of a label indicates the color of largest region in the query’s VOI). Each line represents a gene. A selection/filtering of genes with specific gene expression patterns can be made drawing brushes (the grey boxes) on an axis, you adjust the range per dataset. You can remove the filters with the “Reset Brushing” button (they are also reseted when tabs are switched).

Brushed genes in the parallel coordinate system are automatically filtered for in the table below. The table can be also exported as tsv (tab separated values) with the “.csv”/”.tsv” button of the table.

The parallel coordiantes system and the table below only show genes without missing values (NA), but they can be enabled via the checkbox. In the parallel coordiantes system, missing values will be interpreted as 0.

Note

Query list items from the Gene Expression Query worflow will be automatically set visible in the Gene Expression Analysis workflow. This is because the Gene Expression Analysis workflow is usually the next workflow after Gene Expression Query, so this saves a few clicks.

Global Gene Filter

In the Global gene filter tab, you can define global gene list filters for all tabs, except for BrainTrawler LITE. All results, no matter if it is the text search in the Browse Database tab/workflow, or results of a Gene Expression Query, will be filtered by the genes you select in the “Globally Used Genes” table. You can either select the genes manually in the table, or upload a gene list (as csv/tsv), containing gene identifier (entrezid, ensemblid, symbols) in a column. These can then be matched with the database.

global gene filter

Afterwards, only these selected genes will be displayed, e.g. for a gene expression analysis you have only your (in this case two) selected genes:

gene expression analysis with global gene filter

When the global gene filter is active, the tab “Global Fene Filter” is highlighted in orange.