Ex. Principal Scientist & Coordinator, Biotechnology Information Centre on Aquaculture of ICAR-CIFA Bhubaneswar), Prof. Institute of genetic Engineering, Badu, Kolkata, India
*Address for Correspondence: Ajit Kumar Roy, (Ex. Principal Scientist & Coordinator, Biotechnology Information Centre on Aquaculture of ICAR-CIFA Bhubaneswar), Prof. Institute of genetic Engineering, Badu, Kolkata, India, Tel: 09230011361; Email: akroy1946@yahoo.co.in
Dates: Submitted: 05 August 2017; Approved: 25 August 2017; Published: 28 August 2017
How to cite this article: Roy AK. Microarray Analysis of Fish Genomic Data for enhancing Aquaculture Productivity of India. Ann Proteom Bioinform. 2017; 1: 006-017. DOI: 10.29328/journal.apb.1001002
Copyright License: © 2017 Roy AK. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Keywords: Microarray; Fish genome; Aquaculture; Indian major carps; Salmon; Zebrafish
This review gives a brief introduction to the microarray technology and its experimental design and data analysis and a discussion of recent global progress in research using microarray technology in fish biology and aquaculture. DNA microarrays have been reported to have been used for the analysis of gene expression during various physiological, developmental or cellular processes in fish. During the recent past, investigators have begun to use microarrays on fish to address ecological, evolutionary and environmental questions including the variability of gene expression in natural populations, speciation, ecotype diversity, environmental remediation and host-pathogen interactions. The study suggests that a lot of gene expression studies have been conducted on salmon and zebrafish in Europe and USA. The same may be applied on Indian Major Carps and Catfishes to augment productivity from aquaculture sector.
A microarray is a collection of microscopic DNA spots attached to a solid surface. The DNA microarray is a tool used to determine whether the DNA from a particular individual contains a mutation in genes like BRCA1 and BRCA2.Now a days we use DNA microarrays to measure the expression levels of large numbers of genes simultaneously or to genotype multiple regions of a genome. Microarray technology has been applied to study of gene expression to study mechanisms of diseases and to accelerate the drug discovery process. There is a definite trend towards increasing the use of molecular diagnostic methods, and biochip technologies, along with bioinformatics techniques. Gene expression microarrays are tools that tell how much RNA (if any) a gene is making. Since 1977, and prior to microarray, only a few genes could be studied at a time using the northern blot analysis. Gene Chip microarrays use the natural chemical attraction, or hybridization, between DNA on the array and RNA target molecule from the sample based on complementary base pairs. Only RNA target molecule that have exact complementary base pair bind to the prob. Gene expression detection microarray is that they are able to measure tens of thousands of genes at a time, and it is this quantitative change in the scale of gene measurement that has led to a qualitative change in our ability to understand regulatory processes that occur at the cellular level. In other words, the short probe on the microarray measures the expression of the complete gene by sampling only a small section of the gene. In some instances, as little as one RNA molecule out of 100,000 different RNAs in an original sample may be detected. Microarray is one such technology is widely used by the researchers to investigate and address issues which were once thought to be nontraceable DNA microarrays are becoming a common tool in many areas of microbial research, including microbial physiology, pathogenesis, epidemiology, ecology, phylogeny, and pathway engineering. One can analyze the expression of many genes in a single reaction quickly and in an efficient manner. DNA Microarray technology helps in the identification of new genes and in knowing about their functioning and expression levels under different conditions, comparative genomics, and SNPs identification [1-6]. Proteome chips have been used to study protein-protein interactions [7], protein-DNA interactions [8], protein-lipid interactions [7], protein-drug interactions [9], protein-receptor interactions [10], and antigen-antibody interactions [11]. In addition, proteome chips have been used to study kinase activities [7,12] and have been used for serum profiling [13]. This chapter has focused on use of microarray technology in understanding biological process, and its subsequent applications in Fish biology aiming to enhance aquaculture productivity.
Genomic technology and post-genome technology
The advent of micro-array technology revolutionized biomedical research, for the first time allowing overall characterization (genome-wide) of gene expression. Yet the technology had several flaws limiting sensitivity and specificity that are overcome by sequencing. A first application of NGS was in gene expression profiling. Hybridization on DNA microarrays is a proven technique in genomics research because of its. Flexibility and low cost of production. Microarrays with low-density probes have been used in single-nucleotide polymorphism detection as well as nucleic acid diagnostic application. The impact of microarrays on the development of these new tools have started modifying our daily practice of experimental
How does a DNA microarray work?
Figure 1 shows research in biology. Application of new technology like DNA and protein microarrays necessitates input from molecular biologists, geneticists, chemists, physicists, mathematicians, computer scientists, and engineers. Protein based microarrays are key to this goal, permitting both the characterization of disease systems and the system-wide pharmacophoric properties of putative therapeutic agents. Many cutting-edge microarray analysis tools and algorithms, including commonly used limma and affy packages in Bioconductor, need sophisticated knowledge of mathematics, statistics and computer skills for implementation. Commercially available software can provide a user-friendly interface at considerable cost. To facilitate the use of these tools for microarray data analysis on an open platform is available that is developed an online microarray data analysis platform, WebArray, for bench biologists to utilize these tools to explore data from single/dual color microarray experiments. The currently implemented functions were based on limma and affy package from Bioconductor, the spacings LOESS histogram (SPLOSH) method, PCA-assisted normalization method and genome mapping method. Web Array incorporates these packages and provides a user-friendly interface for accessing a wide range of key functions of limma and others, such as spot quality weight, background correction, graphical plotting, normalization, linear modeling, empirical Bayes statistical analysis, false discovery rate (FDR) estimation, chromosomal mapping for genome comparison. Web Array offers a convenient platform for bench biologists to access several cutting-edge microarray data analysis tools. The website is freely available at http://bioinformatics.skcc.org/webarray [14].
Application of the microarray technology in the studies of Fish Biology
If aquaculture is to play a vital role in ensuring future fish availability for food security and nutrition, this sector has to adopt best practice in aquaculture that requires a full understanding of the genomic controls and transcriptional profiles of cultured fish species as well as improvements in aquaculture through regulation of the expression of functional genes. In the post genomic era, the use of DNA microarray technology in fish biology and aquaculture may have great significance and may be applied to discovery of novel genes, gene expression profiling from fish species of interest, and identification of the genomic responses to environmental stimulation in aquaculture. The use of microarrays for the study of various aspects of fish physiology has seen a spectacular increase in recent years from model species, such as zebra fish, to current studies with commercially important species, such as salmonids, catfish, carp, and flatfish. Besides micro array technology has emerged as a key tool for understanding developmental processes as well as basic physiology. Recent development is the application of micro arrays technology in the fields of ecotoxicology and nutrigenomics. Microarray and gene expression analysis have been key in our understanding of molecular pathways underlying physiological responses. DNA microarrays have been used for the analysis of gene expression during various physiological, developmental or cellular processes in fish. During the recent past, investigators have begun to use microarrays on fish to address ecological, evolutionary and environmental questions including the variability of gene expression in natural populations, speciation, ecotype diversity, environmental remediation and host-pathogen interactions. Given their commercial importance and the availability of several microarray platforms, most of the research in these areas has focused on salmonids. However, the same issues could be studied in a number of fish species like Indian Major Carps, catfish, prawn and minor carps of commercial interest and nutritional values.
As established by quantitative RT-qPCR before the advent of microarrays, 4 to 8 days after DNA vaccination by intramuscular injection, gene expression by fish hematopoietic organs showed an increase in interferon-inducible mx [15-20], virally induced genes (Vig) [21,22] and mhc and tcr genes [23]. The recent availability of fish microarrays [24], which allow the expression profiling of thousands of genes simultaneously, has provided new opportunities to further study fish immunological responses in several rhabdovirus/fish models. Traditional sequencing, annotation and estimation of frequencies of each rhabdovirallyinduced transcript in flatfish, is one of the strategies designed to identify genes transcribed after rhabdoviral infections (pathogen-induced gene approach) [25]. To identify HRV-induced genes were made in the Japanese flounder Paralichthys olivaceous by sequencing 300-596 expressed sequence tag (EST) clones from leukocytes 2-5 days after infection. The frequencies of each EST were estimated within a short 1 to 10 range [26,27]. Japanese flounder EST-derived cDNA microarrays were applied to in vitro kidney cell cultures 3-6 h after HRV infection [28].
Recently, many studies have used microarray technology to determine gene profiles in different fish species. A microarray studies on various species of fishes have been reported [29-37] Literature review suggests that in India few studied have been conducted on Indian major carps [38-40]. The use of DNA microarray technology in fish biology and aquaculture may have great significance and may be applied to discovery of novel genes, gene expression profiling from fish Species of interest, and identification of the genomic responses to environmental stimulation in aquaculture.
Use of the microarray technology in the studies of model Fish, specifically on Zebra Fish and Salmon Fish
Microarrays have already been used to quantify fish gene expression as well as to discover new genes involved in defense in several fish rhabdovirus models, such as flatfish, salmonid (salmon and trout) and zebrafish. Genes that show increased transcription after infection (hypothetically signaling internal organs to react against the viral invasion) and genes whose transcription is inhibited (possibly due to viral shut-off of critical host defenses) might help researchers in their quest to identify new adjuvant candidates for fish vaccines. Few studies have used microarray technology to determine gene profiles in different fish species [5]. A microarray consisting of 4512 complementary DNAs (cDNAs) was constructed to investigate the adaptive molecular responses of zebra fish to hypoxia during development [6,7]. Linney et al. [8], analyzed 15512 unique transcripts from wild Danio rerio and found that 23 muscle specific genes expression data is elucidated, however, it is much analyzed 15512 unique transcripts from wild-type challenging.
Expressed sequence tag (EST)-based microarrays of the Japanese flounder, trout, salmon and zebrafish have been used in gene-discovery efforts. These studies included infections with IHNV [41,42], VHSV [43-45] and hirame rhabdovirus (HRV) [46].The expression of a number of unknown genes was also increased [25]. Among these, the LB3 [8] gene increased a maximum of 56-fold 3 days after infection and then remained increased for one week [43]. Microarrays in the study of the salmonid/IHNV/VHSV models Large-scale genomic projects for salmon have been initiated by groups in Canada, the USA, the UK, Norway and France. As a result there are many physical and genetic maps, large collections of ESTs and a growing number of genomic sequences and derived microarrays. Thus three projects have developed salmonid microarrays. The first salmonid 16K cDNA microarray appeared in 2004. This array was developed by the Genomic Research on Atlantic Salmon Project (GRASP) [47] and led to the most recent 32K cDNA [48,49] and the first 5K oligo DNA of 70-mer [50] microarrays. The high sequence similarity (~86 %) between salmonids (9 genera and 68 species) indicates that cDNA microarrays may be suitable for studies involving any member of this fish family. Transcriptome Analysis of Important Traits of Salmon (TRAITS) and the Norwegian Salmon Genome Project (SGP) also developed a 16K cDNA microarray (http://www.abdn.sfirc/salmon) based on two independent collections of their bacterial clones kept in ARK, Genomics Facility at Roslin Institute, UK and at SGP Genetics Laboratory at the University of Oslo, respectively. The TRAITS-SGP cDNA array was obtained from ESTs from 15 tissues (pathogen-induced libraries, trait-specific subtractive EST, starvation-induced libraries, diet-response libraries, smoltification-response libraries and well-known genes). This array was conceived as a preliminary tool to develop an oligo microarray for routine health monitoring of Atlantic salmon. The first results found some artefactual expression patterns caused by cross-hybridization of similar transcripts and underlined the greater relevance of biological over technical replicates [51].
The new rainbow trout (Oncorhynchus mykiss) high-density, oligonucleotide microarray was developed using 37394 specific 60-mer oligonucleotide probes assembled from 244984 ESTs from 12 tissues (http://compbio.dfci.harvard.edu/tgi/cgibin/tgi/gimain.pl?gudb¼r_trout). The specificity of each probe was checked for possible non-specific mRNA cross-hybridization by comparing all individual probes with all rainbow trout transcriptome sequences. Approximately 91 % of the sequences used for this microarray matched a previously annotated sequence in the GenBank. Few attempts have been made to use these microarrays to study the rhabdoviral immunization of salmonids. In homozygous trout, the 16 K cDNA GRASP microarray was used to profile 7-day muscle transcripts after intramuscular injection of the IHNV G gene [47,49]. After immunization, irf3, mx, vig1, and vig8 transcripts were increased [42]. Genes associated with antigen presenting cells, lymphocytes, leukocytes, inflammation, antigen presentation, and interferon pathways were also augmented. The increased levels of transcripts associated with type I IFN pathways in systemic organs (gill, spleen and kidney) were corroborated by RT-qPCR. These observations confirmed that, when intramuscularly injected, the host expressed viral G gene induces a systemic non-specific type 1 IFN innate immune response. Using a 1.8K cDNA salmonid microarray, comparison of infection-by-injection with IHNV and attenuated IHNV in rainbow trout after 1 and 3 days showed an IHNV-dependent change in differential transcription in kidney towards adaptive immunity genes. Thus, the rapid spread of the IHNV infection inhibited tnfa, mhc1, and several other gene markers while favouring mhc2 and ig responses. The molecular mechanism for the development of late (months) specific cytotoxic T or B cell-mediated humoral responses has not been addressed by means of microarrays [52,53]. More recently, trout families with low (32% survival following challenge) and high susceptibility to VHSV (18% survival following challenge) were infected with VHSV by bath exposure and transcriptional data from internal organs were analyzed with the 16K GRASP microarray from day 3 post-challenge [54]. In total, 939 genes were differentially expressed between infected and non-infected fish. The genes increased in infected fish belonged to the following categories: stress and defense response, NFkappaB signal transduction, response to non-self, antigen processing and presentation, and proteasome complexes. Most were also increased among the 642 differentially expressed genes in the low-susceptibility trout family but not among the 556 differentially expressed genes in the high-susceptibility family. These results suggest that the innate immune system of internal organs plays a crucial role in eliciting an effective immune response to VHSV infection in rainbow trout [54].
The zebrafish Danio rerio is one of the most suitable models in which to carry out microarray studies because, compared to other fish, its genome sequence is one of the most advanced. Furthermore, ~ 40 K annotated quantitative polymerase chain reaction (qPCR) arrays and annotated oligo microarrays are available. In addition, large-scale experimentation with zebrafish is easier than with other fish models and zebrafish are susceptible to several viruses, most of these belonging to the fish rhabdoviral family [55]. Of these, VHSV [56] was chosen in a recent study using microarrays [57] over IHNV [58], snake-head rhabdovirus (SHRV) [59] and spring viremia of carp (SVC) [60], because only in the VHSV/zebrafish model have infection-by-immersion (the natural route of infection) and successful vaccination been described [56]. Damage and epithelial cell death immediately after VHSV infection in the surface portals of entry of these viruses, such as the fins [61], should alert surrounding cells to promote epithelial cell division to replace dead cells, recruit inflammatory cells to the infection site, and send signals to internal immune organs. However, viral-induced signals to inhibit the most relevant host responses have also been detected. Detection of natural early responses may contribute to identifying vaccine adjuvants. Thus, the expression of the 636 immune-related transcripts that were increased after VHSV infection, as estimated by hybridization to oligo microarrays (confirmed by RT-qPCR arrays), was higher in fins than in organs. In contrast, the number of decreased transcripts was higher in organs than in fins. Therefore, an upregulated response of immune-related genes was greatest in fin tissues, while a downregulated response was most detected in the internal organ responses. The latter might be targets of viral inhibitory signals early after infection [57]. These results showed that 2 days after infection-by-immersion, VHSV had not yet caused a strong response from zebrafish internal organs, which contrasts with reports in other fish at later times after infection-by-injection (such as ifn1, mx, il1b, tnfa, etc.) [62-65] or infection-by-immersion [54,66]. The zebrafish are refractory to rhabdoviral infection-by-immersion at high temperatures or without acclimatization to low temperatures with IHNV [58], VHSV [56] or SVC [60]. So, a temperature-dependent response mechanism(s) that inhibits rhabdoviral infection and spread may occur. While these preliminary findings shed some light on the earliest effects of VHSV infection at the molecular level, some of the new immune-related genes identified might be suitable candidate adjuvants for fish vaccines [67-69]. To best detect innate immune responses, early times after rhabdoviral infection should be studied. Thus, according to the data obtained from flatfish, salmonid and zebrafish studies, the maximal number of >2-fold differentially expressed genes in microarrays was detected 2-3 days after rhabdoviral infection [26,43,44, 6,18,27]. As with many other viruses and host species, an increase in ifn1 expression is one of the first responses to the injection of any DNA vaccine [15] and to rhabdoviral infections [70]. Transcripts encoding several forms of il17 were detected as differentially expressed only in one of the studies using microarrays [57].
Development of a 37-K high-density oligonucleotide microarray
A new tool for functional genome research in rainbow trout: A rainbow trout high-density oligonucleotide microarray was constructed using all tentative consensus (TC) sequences that are publicly available from all international rainbow trout Oncorhynchus my kiss genomic research projects through the Rainbow Trout Gene Index database. The new array contains 60-mer oligonucleotide probes representing 37 394 unique TC sequences and 1417 control spots. The array (4×44 format) was manufactured according to the design by Agilent Technologies using the inkjet-based Sure Print technology (design number 016320). The performance of the new microarray platform was evaluated by analyzing gene expression associated with rainbow trout, Vitello genesis-induced muscle atrophy. This microarray will open new avenues of research that will aid in the development of novel strategies for genetic improvement for economically important traits benefiting the salmonid aquaculture industries [72].
Expansion of the Genomics Research on Atlantic salmon Salmo Salar L. Project (GRASP) microarray tools
Salmonids are the most widely studied group of fish, and in the last few years, genomics technologies have begun to contribute to this rich biology. The first salmonid microarrays appeared in 2004 and since then several dozen studies have demonstrated the utility of genomic approaches. The widespread use of the genomics research on Atlantic salmon project 16 k array and greatly expanded genome resources have led to the development of an experimental 5 k oligo (70-mer) array and a 32 k cDNA microarray in the near future. In this paper, the authors examined some of the procedures used in the development of past arrays and reexamined them in light of new genomic data available. Some preliminary control experiments of the new 5 k array were investigated that examine oligo designs based on distance from the polyA tail, the effects of mismatches and cross-species hybridization specificity. Beneficial approaches are also identified in the development of the new 32 k cDNA array [47], and cDNA microarray studies on Atlantic salmon [51] and on Cyprinus carpio [73]. Hepatic gene expression profiles in juvenile rainbow trout (Oncorhynchus my kiss) fed fishmeal or fish oil-free diets is reported [74]. It is observed that seventy-one and seventy-five genes were affected after fish oil and fishmeal replacement respectively. The major part of modified gene expression coding for proteins of the metabolic pathways was as follows: (i) a lower level of expression for genes of energy metabolism found in fish after fishmeal and fish oil replacement; (ii) a lower level of gene expression for fatty acid metabolism (biosynthesis) in fish fed with vegetable oils; (iii) a differential expression of actors of detoxification metabolism in trout fed with vegetable oils; (iv) a lower level of expression of genes involved in protein metabolism in fish fed with plant proteins. Overall, our data suggest that dietary fish oil replacement is linked to a decreased capacity of fatty acid biosynthesis (fatty acid syntheses) and variation of detoxification metabolism (cytochrome P450s) whereas dietary fishmeal replacement may depress protein metabolism in the liver as reflected by glutamine synthesize.
Use of microarray technology in the studies of fish reproduction
Several articles are available on Steroid-induced sex reversal in rainbow trout is reported [75]. Intra-gonadal molecular mechanisms involved in the regulation of rainbow trout spermatogenesis is studied [76]; Hormonal induction or photoperiod manipulation of ovulation result in differences in the egg maternal mRNA abundance of specific genes-Prohibitin mRNA accumulation is negatively correlated to embryo developmental potential-The hormonal induction and the photoperiod manipulation of the ovulation affect egg quality through common and distinct mechanisms [78,79]; Molecular phenotype of zebrafish ovarian follicle by serial analysis of gene expression and proteomic profiling, and comparison with the transcriptomes of other animals [80]; A comprehensive survey of the genes involved in maturation and development of the rainbow trout ovary [81] and Sensitivity issues in DNA array-based expression measurements and performance of nylon microarrays for small samples [82]. Identification of candidate’s genes involved in oocyte maturation and/or ovulation [83]; Identification of proteins in full grown oocytes [77] Identify new proteins that may have an extra-ovarian origin-Identify new proteins may have been accumulated much before gene expression ends and identification of proteins in body fluids -search for protein markers of post-ovulatory egg ageing in trout coelomic fluid [84]. Use of a generic cDNA microarray to study rainbow trout reproductive issues and Sex ratio control-natural sex differentiation and steroid induced sex reversal [75]; Control of male sexual maturation-spermatogenesis regulation by gonadotropins and sex steroid hormones [76]. Hormonal or photoperiod control of ovulation affect egg quality ([77] maternal inherited RNA stocks in full grown egg of zebrafish [79]; Oogenesis and egg quality-Follicular competence to ovulation -Egg quality after natural or controlled ovulation [79]).
The study indicates that a lot of gene expression studies have been conducted on salmon and zebrafish in Europe and USA. The same may be applied on Indian Major Carps and Catfishes to augment productivity from aquaculture sector it is expected that abovementioned compilation/findings will be helpful for future researchers in perusing studies on species of Indian origin.
The author is thankful to the unanimous referee for suggestions in improving the manuscript.