3 * Jalview - A Sequence Alignment Editor and Viewer ($$Version-Rel$$)
4 * Copyright (C) $$Year-Rel$$ The Jalview Authors
6 * This file is part of Jalview.
8 * Jalview is free software: you can redistribute it and/or
9 * modify it under the terms of the GNU General Public License
10 * as published by the Free Software Foundation, either version 3
11 * of the License, or (at your option) any later version.
13 * Jalview is distributed in the hope that it will be useful, but
14 * WITHOUT ANY WARRANTY; without even the implied warranty
15 * of MERCHANTABILITY or FITNESS FOR A PARTICULAR
16 * PURPOSE. See the GNU General Public License for more details.
18 * You should have received a copy of the GNU General Public License
19 * along with Jalview. If not, see <http://www.gnu.org/licenses/>.
20 * The Jalview Authors are detailed in the 'AUTHORS' file.
23 <title>UniProtKB query fields</title>
28 <strong>UniProtKB query fields</strong>
30 <p>Supported query fields for searching specific data in UniProtKB (see also <a href="text-search">query syntax</a>).</p>
32 <table border="1" width="95%">
41 <code>accession:P62988</code>
44 Lists all entries with the primary or secondary
45 accession number P62988.
51 <code>active:no </code>
54 Lists all obsolete entries.
61 annotation:(type:non-positional)
63 annotation:(type:positional)
65 annotation:(type:mod_res "Pyrrolidone carboxylic acid" evidence:experimental)
69 Lists all entries with:
71 <li>any general annotation (comments [CC])</li>
72 <li>any sequence annotation (features [FT])</li>
73 <li>at least one amino acid modified with a Pyrrolidone carboxylic acid group</li>
85 Lists all entries with at least one reference co-authored by Michael Ashburner.
96 Lists all entries whose cluster of differentiation number is CD233.
103 citation:("intracellular structural proteins")
105 citation:(author:ashburner journal:nature)
110 Lists all entries with a literature citation:
112 <li>containing the phrase "intracellular structural proteins" in either title or abstract</li>
113 <li>co-authored by Michael Ashburner and published in Nature</li>
114 <li>with the PubMed identifier 9169874</li>
122 cluster:UniRef90_A5YMT3
126 Lists all entries in the UniRef 90% identity cluster whose
127 representative sequence is UniProtKB entry A5YMT3.
134 annotation:(type:transmem count:5)<br />
135 annotation:(type:transmem count:[5 TO *])<br />
136 annotation:(type:cofactor count:[3 TO *])
139 <td>Lists all entries with:
141 <li>exactly 5 transmembrane regions</li>
142 <li>5 or more transmembrane regions</li>
143 <li>3 or more Cofactor comments</li>
151 created:[20121001 TO *]<br />
152 reviewed:yes AND created:[current TO *]
156 Lists all entries created since October 1st 2012.<br />
157 Lists all new UniProtKB/Swiss-Prot entries in the last release.
166 database:(type:pdb 1aut)
170 Lists all entries with:
172 <li>a cross-reference to the Pfam database</li>
173 <li>a cross-reference to the PDB database entry 1aut</li>
186 Lists all entries with a Von Willebrand factor type A domain described
187 in the 'Family and Domains' section.
198 Lists all beta-galactosidases.
205 annotation:(type:signal evidence:ECO_0000269)<br />
206 (type:mod_res phosphoserine evidence:ECO_0000269)<br />
207 annotation:(type:function AND evidence:ECO_0000255)
210 <td>Lists all entries with:
212 <li>a signal sequence whose positions have been experimentally proven</li>
213 <li>experimentally proven phosphoserine sites</li>
214 <li>a function manually asserted according to rules</li>
226 Lists all entries belonging to the Serpin family of proteins.
237 Lists all entries with an incomplete sequence.
249 Lists all entries for proteins encoded by gene HSPC233.
262 Lists all entries associated with:
264 <li>a GO term containing the word "cytoskeleton"</li>
265 <li>the GO term Actin cytoskeleton and any subclasses</li>
281 Lists all entries for viruses infecting:
283 <li>organisms with a name containing the word "mouse"</li>
284 <li>Mus musculus (Mouse)</li>
285 <li>all mammals (all taxa classified under the taxonomy node for Mammalia)</li>
292 <code>id:P00750</code>
295 Returns the entry with the primary
296 accession number P00750.
307 Lists all entries whose "International Nonproprietary Name" is Anakinra.
318 Lists all entries describing interactions with the protein described by
330 Lists all entries associated with the keyword Toxin.
341 Lists all entries describing sequences of length between 500 and 700 residues.
348 This field is a synonym for the field <code>taxonomy</code>.
359 Lists all entries describing sequences with a mass of at least 500,000 Da.
372 Lists all entries for proteins identified by: matrix-assisted laser
373 desorption/ionization (MALDI), crystallography (X-Ray). The
374 <code>method</code> field searches names of physico-chemical
375 identification methods in the 'Biophysicochemical properties' subsection of the 'Function' section, the 'Publications' and
376 'Cross-references' sections.
387 Lists all entries with entry name (ID) ATP6_HUMAN. Searches also
388 obsolete entry names.
395 modified:[20120101 TO 20120301]<br />
396 reviewed:yes AND modified:[current TO *]
400 Lists all entries that were last modified between January and March 2012.<br />
401 Lists all UniProtKB/Swiss-Prot entries modified in the last release.
412 Lists all entries for prion proteins.
419 organelle:Mitochondrion
423 Lists all entries for proteins encoded by a gene of the mitochondrial
431 organism:"Ovis aries"
440 Lists all entries for proteins expressed in sheep (first 2 examples) and
441 organisms whose name contains the term "sheep".
453 Lists all entries for proteins encoded by a gene of plasmid ColE1.
464 Lists all entries from the human proteome.
468 <td>proteomecomponent</td>
471 proteomecomponent:"chromosome 1" and organism:9606
475 Lists all entries from the human chromosome 1.
486 Lists all entries that were created from a merge with entry P02023.
497 Lists all UniProtKB/Swiss-Prot entries.
508 Lists all entries containing a reference that was used to gather
509 information about mutagenesis.
520 Lists all entries containing a link to isoform 9 of the sequence
521 described in entry P05067. Allows searching by specific sequence
526 <td>sequence_modified</td>
529 sequence_modified:[20120101 TO 20120301]<br />
530 reviewed:yes AND sequence_modified:[current TO *]
534 Lists all entries whose sequences were last modified between January and March 2012.<br />
535 Lists all UniProtKB/Swiss-Prot entries whose sequences were modified in the last release.
546 Lists all entries containing a GO term whose annotation source is the
558 Lists all entries containing a reference relevant to strain wistar.
569 Lists all entries for proteins expressed in Mammals. This field is used to retrieve
570 entries for all organisms classified below a given taxonomic node taxonomy classification).
581 Lists all entries containing a reference describing the protein sequence
582 obtained from a clone isolated from liver.
593 Lists all entries for proteins that are described in Wikipedia.