Querying multiple genes (Batch Query)
On this page:
- Multiple string search
- Multiple string search on a specific filter
- How to convert a column of gene symbols in a spreadsheet to a search string
A batch query is when you want to search a number of items at the same time - such as gene symbols or RIDs (Record IDs that uniquely identify every record in GUDMAP).
Multiple string search
Our search feature supports pattern matching (based on PostgresQL regular expression syntax).
When you want to search for a disjunction of symbols such as gene names (e.g.
COX1), use the character
| to separate different symbols, e.g.
For example, entering:
in the main search box will return all the records that have either the sub-string
Six2 associated with any metadata as the result.
- There should be no spaces between
|and individual search strings.
- The search string is not case sensitive.
- You can see the search syntax in the interface by hovering over the magnifying glass icon next to the search box to view the tooltip.
Multiple string search on a specific filter
If you want the search to be limited to only a certain property (e.g. the Gene Symbol on the Gene page), you will need to apply the search string to the specific search panel for that filter (see below) and then select the attributes you want to filter on.
How to convert a column of gene symbols in a spreadsheet to a search string
Most spreadsheet applications support the concatenation or the text join on a range of cells. For Open Office and Google Sheets, use the following formula to create the search string.
TEXTJOIN(delimiter, ignore_empty, text1, [text2, ...]) Where - delimiter: A string, possibly empty, or a reference to a valid string. If empty, text will be simply concatenated. - ignore_empty: A boolean; if TRUE, empty cells selected in the text arguments won't be included in the result. - text1 - Any text item. This could be a string, or an array of strings in a range. - text2, ... [OPTIONAL] - Additional text item(s). Sample Usage: TEXTJOIN(“|“, TRUE, “ATP6”, “COX1”) TEXTJOIN(“|”, TRUE, A2:A101) Where A2:A101 is the cell range containing gene symbols