3 * Jalview - A Sequence Alignment Editor and Viewer ($$Version-Rel$$)
4 * Copyright (C) $$Year-Rel$$ The Jalview Authors
6 * This file is part of Jalview.
8 * Jalview is free software: you can redistribute it and/or
9 * modify it under the terms of the GNU General Public License
10 * as published by the Free Software Foundation, either version 3
11 * of the License, or (at your option) any later version.
13 * Jalview is distributed in the hope that it will be useful, but
14 * WITHOUT ANY WARRANTY; without even the implied warranty
15 * of MERCHANTABILITY or FITNESS FOR A PARTICULAR
16 * PURPOSE. See the GNU General Public License for more details.
18 * You should have received a copy of the GNU General Public License
19 * along with Jalview. If not, see <http://www.gnu.org/licenses/>.
20 * The Jalview Authors are detailed in the 'AUTHORS' file.
23 <title>UniProtKB query fields</title>
28 <strong>UniProtKB query fields</strong>
31 Supported query fields for searching specific data in UniProtKB (see
32 also <a href="uniprotsequencefetcher.html#text-search">query
36 <table border="1" width="95%">
45 <code>accession:P62988</code>
48 Lists all entries with the primary or secondary
49 accession number P62988.
55 <code>active:no </code>
58 Lists all obsolete entries.
65 annotation:(type:non-positional)
67 annotation:(type:positional)
69 annotation:(type:mod_res "Pyrrolidone carboxylic acid" evidence:experimental)
73 Lists all entries with:
75 <li>any general annotation (comments [CC])</li>
76 <li>any sequence annotation (features [FT])</li>
77 <li>at least one amino acid modified with a Pyrrolidone carboxylic acid group</li>
89 Lists all entries with at least one reference co-authored by Michael Ashburner.
100 Lists all entries whose cluster of differentiation number is CD233.
107 citation:("intracellular structural proteins")
109 citation:(author:ashburner journal:nature)
114 Lists all entries with a literature citation:
116 <li>containing the phrase "intracellular structural proteins" in either title or abstract</li>
117 <li>co-authored by Michael Ashburner and published in Nature</li>
118 <li>with the PubMed identifier 9169874</li>
126 cluster:UniRef90_A5YMT3
130 Lists all entries in the UniRef 90% identity cluster whose
131 representative sequence is UniProtKB entry A5YMT3.
138 annotation:(type:transmem count:5)<br />
139 annotation:(type:transmem count:[5 TO *])<br />
140 annotation:(type:cofactor count:[3 TO *])
143 <td>Lists all entries with:
145 <li>exactly 5 transmembrane regions</li>
146 <li>5 or more transmembrane regions</li>
147 <li>3 or more Cofactor comments</li>
155 created:[20121001 TO *]<br />
156 reviewed:yes AND created:[current TO *]
160 Lists all entries created since October 1st 2012.<br />
161 Lists all new UniProtKB/Swiss-Prot entries in the last release.
170 database:(type:pdb 1aut)
174 Lists all entries with:
176 <li>a cross-reference to the Pfam database</li>
177 <li>a cross-reference to the PDB database entry 1aut</li>
190 Lists all entries with a Von Willebrand factor type A domain described
191 in the 'Family and Domains' section.
202 Lists all beta-galactosidases.
209 annotation:(type:signal evidence:ECO_0000269)<br />
210 (type:mod_res phosphoserine evidence:ECO_0000269)<br />
211 annotation:(type:function AND evidence:ECO_0000255)
214 <td>Lists all entries with:
216 <li>a signal sequence whose positions have been experimentally proven</li>
217 <li>experimentally proven phosphoserine sites</li>
218 <li>a function manually asserted according to rules</li>
230 Lists all entries belonging to the Serpin family of proteins.
241 Lists all entries with an incomplete sequence.
253 Lists all entries for proteins encoded by gene HSPC233.
266 Lists all entries associated with:
268 <li>a GO term containing the word "cytoskeleton"</li>
269 <li>the GO term Actin cytoskeleton and any subclasses</li>
285 Lists all entries for viruses infecting:
287 <li>organisms with a name containing the word "mouse"</li>
288 <li>Mus musculus (Mouse)</li>
289 <li>all mammals (all taxa classified under the taxonomy node for Mammalia)</li>
296 <code>id:P00750</code>
299 Returns the entry with the primary
300 accession number P00750.
311 Lists all entries whose "International Nonproprietary Name" is Anakinra.
322 Lists all entries describing interactions with the protein described by
334 Lists all entries associated with the keyword Toxin.
345 Lists all entries describing sequences of length between 500 and 700 residues.
352 This field is a synonym for the field <code>taxonomy</code>.
363 Lists all entries describing sequences with a mass of at least 500,000 Da.
376 Lists all entries for proteins identified by: matrix-assisted laser
377 desorption/ionization (MALDI), crystallography (X-Ray). The
378 <code>method</code> field searches names of physico-chemical
379 identification methods in the 'Biophysicochemical properties' subsection of the 'Function' section, the 'Publications' and
380 'Cross-references' sections.
391 Lists all entries with entry name (ID) ATP6_HUMAN. Searches also
392 obsolete entry names.
399 modified:[20120101 TO 20120301]<br />
400 reviewed:yes AND modified:[current TO *]
404 Lists all entries that were last modified between January and March 2012.<br />
405 Lists all UniProtKB/Swiss-Prot entries modified in the last release.
416 Lists all entries for prion proteins.
423 organelle:Mitochondrion
427 Lists all entries for proteins encoded by a gene of the mitochondrial
435 organism:"Ovis aries"
444 Lists all entries for proteins expressed in sheep (first 2 examples) and
445 organisms whose name contains the term "sheep".
457 Lists all entries for proteins encoded by a gene of plasmid ColE1.
468 Lists all entries from the human proteome.
472 <td>proteomecomponent</td>
475 proteomecomponent:"chromosome 1" and organism:9606
479 Lists all entries from the human chromosome 1.
490 Lists all entries that were created from a merge with entry P02023.
501 Lists all UniProtKB/Swiss-Prot entries.
512 Lists all entries containing a reference that was used to gather
513 information about mutagenesis.
524 Lists all entries containing a link to isoform 9 of the sequence
525 described in entry P05067. Allows searching by specific sequence
530 <td>sequence_modified</td>
533 sequence_modified:[20120101 TO 20120301]<br />
534 reviewed:yes AND sequence_modified:[current TO *]
538 Lists all entries whose sequences were last modified between January and March 2012.<br />
539 Lists all UniProtKB/Swiss-Prot entries whose sequences were modified in the last release.
550 Lists all entries containing a GO term whose annotation source is the
562 Lists all entries containing a reference relevant to strain wistar.
573 Lists all entries for proteins expressed in Mammals. This field is used to retrieve
574 entries for all organisms classified below a given taxonomic node taxonomy classification).
585 Lists all entries containing a reference describing the protein sequence
586 obtained from a clone isolated from liver.
597 Lists all entries for proteins that are described in Wikipedia.