Bioinformatics-with-teja: perl problem -4 (easy)

Monday, June 8, 2009

hi everyone,

Write a program that scans a given DNA string for incorrect nucleotides (i.e other than A,G,C,T) and replace them with a *

Example input

AGUCGACFTGCTCGATC

Example output

AG*CGAC*TGCTCGATC

Amit BhosaleJune 9, 2009 at 6:01 PM
Isn't U one of the nucleotide bases? When translated T becomes U right? Correct me if I am wrong but m-RNA sequences will contain U, and so a * must be placed at places other than A,T, C, G and U. But if we are looking at an m-RNA sequence then, there must be no T's. So how do you propose to approach this problem? This can definitely be applied to areas of data curation in new databases.
ReplyDelete
Replies
tejaJune 9, 2009 at 8:17 PM
Thats why I have specified 'DNA' string. DNA when transcribed into mRNA have 'U'.(T in DNA will be converted to U in mRNA).Translation is a different process where mRNA is converted to protein.DNA strings wont contain 'U'.They will contain only AGCT . mRNA will contain AGCU. So u have to place a * at U also.
ReplyDelete
Replies
tejaJune 12, 2009 at 11:41 AM
HERE IS A SOLUTION TO PERL PROBLEM 4 BY AMIT

my $input="AGUCGACFTGCTCGATC";
my @temp=split("",$input);
my $output="";

foreach my $char ( @temp ) {
if ( $char =~ /[AGCT]/ ) {
$output .= $char;
}
else
{
$output .= "*";
}

}

print "\n Output : $output \n";
ReplyDelete
Replies
UnknownJune 14, 2009 at 6:25 AM
chomp($input = <>);
$input =~ s/[^AGTC]/\*/g;
print "$input\n";
ReplyDelete
Replies

Bioinformatics-with-teja