I have downloaded source code for 97 files using:
then run a rename loop:
to keep the html tags but make the file a text that can be parsed.
In each of the 97 txt files the gene # is variable, but the gene is associated or should have a corresponding OMIM #. They are all in,
Is there a way to search the source code for these gene names and OMIM #’s?
For example, in the attached file there are 26 genes:
Output (tab-delimited)
The gene names seem to be after
and the OMIM # seem to be
I think
can parse html but I am not familiar enough to know how to code it for multiple files in a directory.
Thank you to all for the help :).
Code:
wget -x -i link.txt
Code:
for file in *
do
mv $file $file.txt
done
In each of the 97 txt files the gene # is variable, but the gene is associated or should have a corresponding OMIM #. They are all in,
Code:
C:\Users\cmccabe\Desktop\list\geneticslab.emory.edu.txt\tests_txt
For example, in the attached file there are 26 genes:
Output (tab-delimited)
Code:
A B
Gene OMIM
AKT1 164730
ALK 105590
APC 611731
Code:
target = '_blank'>AKT1</a>
Code:
style = 'margin-bottom:10px;'><a href =
Code:
sed
Thank you to all for the help :).