Probe-mining of endo-1,4-beta-xylanase from goats-rumen bacterial metagenomic DNA data
Endo-1,4-beta-xylanases (xylanases) are classified into 9 glycoside hydrolase families, GH5, 8, 10, 11, 30, 43, 51, 98, and 141 based on the CAZy database. The probe sequences representing the enzymes were constructed from published sequences of actual experimental studies with xylan decomposition activity. From online databases, we found one sequence belonging to the GH5 family, 6 sequences belonging to the GH8 family and 5 sequences belonging to the GH30 family exhibiting xylanase activity. Thus specific probes for xylanase GH8 and GH30 families were designed with the length of 351 and 425 amino acids respectively. The reference values for the probe of the GH8 family were defined as the sequences with maximum score greater than 168, the lowest coverage was 84%, the lowest similarity was 36%; for the probe GH30, the maximum score was greater than 316, the coverage was greater than 98%, the similarity was greater than 41%. Using the built probes, including the probe of the two GH10 and GH11 families, we found 41 xylanase-encoding sequences from the metagenomic DNA data of bacteria in Vietnamese goats’rumen. Of the 41 exploited sequences, 19 were identical to the BGI company's annotation result based on KEGG database, whereas there were 16 sequences that are not annotated by the BGI company. Total 28 of 41 exploited sequences were complete open reading frames, of which the predicted ternary structure was highly similar to the published structures of xylanase.