I want to use your co-evolution website to find my co-evolution amino
acid in my interested protein. I found my proteins family in different
species by blast. but when i uploaded it to the server. server sent me an
email that told me your sequences are not enough.i loaded an example in your
server to found out how it worked but it had a problem too and error
message sent to me.please help me to how to use this server.
We set the minimum number of sequences as 50 to ensure statistical significance
of the results. If you really want to run with fewer sequences, you can change
the number in the advanced options.
As for the other error, please send me the error message if you still cannot run
I have conducted structure based statistical coupling analyses (SCA) on each
of some mitochondrial proteins using 800 multiple sequences (including one
sequence from our organisms, one 3RKO structure sequence, and 788 protein
sequences from different genera), and we could obtain the coevolutionary
scores and spatial distances between any pair of two residues. The aim of
our study is try to analyze the coevolutionary role of some important given
residues (selected by PAML analyses) on key or important residues
responsible for proton translocation in the proton translocating channel of
respiratory Complex I. The problem is we are not sure how to do it in a more
statistical way. Such as, we could have the data of scores and distances of
a given selected residue on these residues in proton channel or other
residues of the same protein. In order to know possible different
coevolutionary role of a given residue i.e. the selective residue on proton
channel residues or other residues, t-test on scores (s), or distances (d)
or sores/distinces (s/d) were compared by us between those types of
residues, we are not sure if this kind of analyse is ok for us. Such as we
don’t know whether the score obtained by SCA analyses in the platform has
alreadly considered the potential role of distance, or it is just the score
obtained no mattter where both residues are? We know the influencing role
between any two given residues might be correlated with both their
characteristics and spatial distance between them.
Do you have any good idea on this, or do you have more reasonable
statistical way to solve our queries and prolem above?
The scores were calculated based on the MSA alone without
considering the spatial distance between residues.
You may want to plot the global distribution of scores, and look
for scores that are significantly larger than the rest but cannot be
explained by the distance on the primary sequence alone. Indirect
coupling between residues though other residues is also something to be
aware of. There have been a lot of new papers about co-evolutionary
analysis lately (e.g., from Rama Ranganathan’s and Debora Marks’s labs).
I am trying to submit an MSA to your "coevolution" webpage — and it keeps failing on me.
i keep getting an error email saying "could not be completed" Error message:Not enough sequences.
im sure that my MSA is in fasta format… so not really sure what is going wrong…
if you could help me out, it would be much appreciated.
The tool performs a number of filtering steps, to ensure the
reliability of the results. The error messages states that after
filtering, the number of remaining sequences is too small for a
reliable analysis. You may change the filtering criteria using the
advanced options (hidden by default), but please notice that by doing
so you may get results that are unreliable.
The seed alignment of PF01036 has been changed recently.
BACR_HALSA is no longer one of the seed sequences. I have just changed
the example to use BACR_HALAR instead, and the program ran fine.
Please let me know if you encounter any problem running the program on
your own data.
I’ve been trying to use your server, but evidently it is not working
correctly; your example data even fails to process.
I am eager to use this server (and likewise cite you), so please let
me know at your earliest convenience if/how I can use this server.
I have been using the co-evolution server and have a few queries regarding the analysis of the results. I would like to know the significance of the Mean, PMean values etc in the analysis document.A link to any publication regarding the same would be very helpful.
Our publication is at http://papers.gersteinlab.org/papers/coevolution
I have read the article “An integrated system for studying residue coevolution in proteins” and start to use the coevolution webserver http://coevolution.gersteinlab.org/coevolution/. The help page says SCA is one of the coevolution score function, which I can‘t find on the webpage. Could you please tell me what is wrong?
SCA is only available in the version for download, but not at the Web site anymore since it sometimes ran for an extremely long time. If you want to use the SCA method, you can use the download version or the latest SCA method by the Ranganathan group, which I think is better than the version we implemented.
We are interested in
adding a link to your tool on the website. I have been playing around with
the tool but have been having some difficulty. I loaded the example, ran the
analysis and downloaded the results with no problem. Then, I tried to use
the data from the example to re-run the analysis by uploading the PF01036
fasta file (as an MSA) and listing BACR_HALSA as the reference sequence. I
did not load the tree data or the structure data. I submitted the request
and received the following error:
The coevolution analysis task that you submitted at 2013-09-11 11:35:07.0
could not be completed.
Error message:Not enough sequences.
In order to justify the addition of the link to the website, we need to make
sure that the web interface is simple and easy to use. Can you help me
understand what the problem might be (clearly I have enough sequences since
the example runs without any issues)? Also, can you explain to me the link
between the 1C3W reference structure and the sequence data?
Any help you can give would be greatly appreciated.
The error message is due to an internal check of the system. Since
coevolution analysis requires a good number of sequences to give
reliable statistics, there is a minimum threshold of the number of
sequences after the filtering step. When the example is loaded, I think
some of the filtering settings are made so that it can pass the minimum
threshold. When one manually uploads the MSA, the default settings could
You may try changing the setting for the minimum number of
sequences in the "Advanced options" section (which is by default hidden)
and rerun. Let me know if you still get any error message.