Codondex: transcript

Showing posts with label transcript. Show all posts

Monday, March 4, 2024

p53 Direct Mechanisms In Immunity

Never in the field of molecular oncology have so many sites of posttranslational modification in one protein (p53) been modified by so many different enzymes, but direct response mechanisms that increase immune receptors are rarely discovered and have important implications.

In the tumor microenvironment (TME), cancer associated fibroblasts (CAFs) display an activated phenotype and can physically remodel the extracellular matrix (ECM). Silencing p53 in the CAFs strongly compromised this activity, implicating p53 as a key contributor to a distinctive CAF feature. Here, the non-autonomous, tumor-suppressive activity of non-mutant p53 cDNA is rewired to become a significant contributor to the CAFs’ tumor-supportive activities. This surprising role for p53 in CAFs suggests that, during tumor progression p53 functionality is altered, not only in the cancer cells, but also in their adjacent stroma.

Although p53 is not mutated in the human placenta, it has become functionally incompetent. Why and how p53 is functionally incompetent in cytotrophoblast cells might well be the key to understanding trophoblast invasion. Vascular remodeling for placentation is controlled by small populations of conventional Natural Killer cells, distinct from much larger populations of uterine NK cells, that acidify the ECM with a2V-ATPase, that activates MMP9, degrades the ECM and releases stored pro-angiogenesis growth factors. Similarly hypoxic TME's that in NK cells sustain excessive mitochondrial fission resulting in fragmentation could cause a2V-ATP activated MMP9 to similarly degrade ECM and promote angiogenesis in the early TME.

Another MMP protein, MMP2 is a ligand for the Toll-like receptor 2 (Tlr2). Expression of Tlr2 and Tlr4 in the TME is important for the promotion of tumor growth, and when both of these receptors are absent, growth is compromised. Furthermore, the expression of Tlr2 and Tlr4 in both hematopoietic and stromal compartments appears to support MMP2-driven tumor growth.

The integration of the TLR gene family into the p53 regulatory network is unique to primates. p53 promoter response elements that are targeted by this DNA damage and stress-responsive regulator suggest a general p53 role in the control of human TLR gene expression. TLR genes show responses to DNA damage, and most are p53-mediated. TLR's mediate innate immunity to a wide variety of threats through recognition of conserved pathogen-associated molecular motifs. Expression of all TLR genes, in blood lymphocytes and alveolar macrophages from healthy volunteers can be induced by DNA metabolic stressors with considerable inter-individual variability. Most TLR genes respond to p53 via canonical as well as noncanonical promoter binding sites.

A polymorphism in a TLR8 response element provided the first human example of a p53 target sequence specifically responsible for endogenous gene induction. These findings—demonstrating that the human innate immune system, including downstream induction of cytokines, can be modulated by DNA metabolic stress—have many implications for health and disease, as well as for understanding the evolution of DNA damage and p53 responsive networks. That p53 can directly increase an inflammatory response differs from the generally held view relating to the antagonistic affect of p53 on inflammation directed by NF-κB. However, the direct mechanism here is different in that it involves another p53-mediated increase in a receptor that translates ligand interactions into cytokine responses.

Wednesday, September 27, 2023

When Immunity Fails Programmed Cell Death

DNA Damage Response

Telomeric repeat (TR) sequences are responsible for genome integrity, where instability is a primary factor that leads to activation of p53. Introduction of a TR into cells leads to stabilization of p53, specific to TRs and not observed in plasmids containing non-TR sequences. TR-activated p53 exhibited enhanced transcriptional activity and induced p53-dependent growth suppression, measured as a reduction in colony formation. Sub-telomeric p53 binding prevents accumulation of DNA damage at human telomeres.

Healthy cells experience thousands of DNA lesions per day. Micronuclei, containing broken fragments of DNA or chromosomes, that have become isolated, are recognized as one mediator of DNA damage response (DDR)-associated immune recognition. Like micronuclear DNA, mitochondrial DNA (mtDNA) is recognized by cGAS to drive STING-mediated inflammatory signaling. Mitochondrial damage can intersect DNA repair and inflammatory cascades with programmed cell death, through p53. In human fibroblasts and conditionally immortalized vascular smooth muscle cells p53 mediates CD54 (ICAM-1) overexpression in senescence.

Replicative senescence, an autophagy dependent program and crisis are anti-proliferative barriers that human cells must evade to gain immortality. Telomere-to-mitochondria signaling by ZBP1 mediates replicative crisis. Dysfunctional telomeres activate innate immune responses (IFN) through mitochondrial TR RNA (TERRA)–ZBP1 complexes. Senescence occurs when shortened telomeres elicit a p53 and RB dependent DNA-damage response. A crisis-associated isoform of ZBP1(innate immune sensor) is induced by the cGAS–STING DNA-sensing pathway, but reaches full activation only when associated with TERRA transcripts from dysfunctional telomeres. p53 utilizes the cGAS/STING innate immune system pathway for both cell intrinsic and cell extrinsic tumor suppressor activities. cGAS-STING activation induces the production of IFN-b and increases CD54 expression in human cerebral microvascular endothelial cells.

In melanoma patients there is a significant correlation between cGAS expression levels and survival and between NK cell receptor expression levels and survival. Loss of cGAS expression by tumor cells could permit the tumor cell to circumvent senescence or prevent immunostimulatory NKG2D ligands expression. Loss of p53 and gain of oncogenic RAS exacerbated pro-malignant paracrine signaling activities of senescence-associated secretory phenotypes. Results imply that heterogeneity in cGAS activity, across tumors, could be an important predictor of cancer prognosis and response to treatment and suggest that NK cells could play an important role in mediating anti-tumor effects. Coculture of wild-type p53-induced human tumor cells with primary human NK cells enhanced NKG2D-dependent degranulation and IFN-γ production by NK cells.

When p53 consensus sequences are modified and DNA damage response is compromised, replicative crisis ensues, mitochondrial membranes misfunction, mtDNA expression is downregulated and IFN signaling upregulates. A cell may then express activating immune ligands that bind NK receptors signaling non-self and cytolytic death or inhibitory receptors that signal self and immortality.

Sunday, January 16, 2022

Evidence of Purposeful Evolution

Darwin's evolution challenged!

A recently published article in Nautre challenged evolution theory suggesting DNA repair was the more likely candidate driving evolutionary development than the environmental conditions thought to be the driver of natural selection. In some sense the two may be linked, but this study showed how epigenome-associated mutation bias reduced the occurrence of deleterious mutations, challenging the prevailing paradigm that mutation is a directionless force in evolution.

Quantitative assessment of DNA gain and loss through DNA double-strand break (DSB) repair processes suggests deletion-biased DSB repair causes ongoing genome shrinking in A. thaliana, whereas genome size in barley remained nearly constant.

Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DSB repair. Alu elements are the most abundant transposable elements (capable of shifting their positions) containing over one million copies dispersed throughout the human genome.

The emergence of recombination-activating genes (RAGs) in jawed vertebrates endowed adaptive immune cells with the ability to assemble a diverse set of antigen receptor genes. Innate Natural Killer (NK) cells are unable to express RAGs or RAG endonuclease activity during ontogeny. They exhibit a cell-intrinsic hyperresponsiveness, but a diminished capacity to survive following virus-driven proliferation, a reduced expression of DNA damage response mediators, and defects in the repair of DNA breaks. However, RAG expression in uncommitted hematopoietic progenitors and NK cell precursors marks functionally distinct subsets of NK cells in the periphery, demonstrating a novel role for RAG in the functional specialization of the NK cell lineage.

The most active region of Human Chromosome 19 has a long history of recombinations that define the expression patterns of telomeric and centromeric proportions of Killer-cell immunoglobulin-like receptor (KIR) gene's encoding receptors. KIR's bind cells presenting MHC class 1 HLA haplotype combinations, that vary significantly across tissues in different population groups. Further, the deletion rate in Zinc Finger clusters (ZNF) located around 19q13.42, near KIR and C19MC between 51,012,739 and 55,620,741 are about twofold higher than the background deletion rate.

The relationship between deletions and mutation may indeed play a direct role in rapidly evolving, innate immunity. This may just begin to explain the speed at which global populations can respond and survive pandemics caused by the likes of COVID-19. And, the '19' in its nomenclature may go beyond time to the very chromosome responsible for innate immune diversity.

Tuesday, October 19, 2021

Blood Pressure, Immunity and p53 Checkpoint.

Background

A few chromosome 19 curiosities developed into a deep-dive after looking into the primordial immune complex, the origins of MHC Class I and antigen receptors as revealed by comparative genomics. And the plot thickened because repressors (of endogenous retroviruses) that gained their binding affinity to retrovirus sequences at the same time their targets invaded the human lineage are preferentially located on chromosome 19. Further, the deletion rate in Zinc Finger clusters (ZNF) located around 19p.12 and 19q13.42, particularly between 51,012,739 and 55,620,741 are about twofold higher than the background deletion rate. A lot going on at this very active location which motivated this article.

At 19q13.42 kallikrein related peptidase (KLK’s), leukocyte immunoglobulin-like receptors (LILR’s) including killer-cell immunoglobulin-like receptor (KIR’s) as well MYADM, an important blood pressure related gene may also provide some clues to immunity variables that originate from or are influenced by this volatile region.

The retrotransposon bombardment of 19q13.42 and double background deletion rate is a significant remnant. However, after evolutionary MHC changed chromosomes ZNF, and within its range the chromosome 19 miRNA cluster (C19MC - 53,671,968 and 54,264,387) were still subjected to the deleterious effect of transposons. Regardless, suppression mechanics have kept epigenetic, regulatory and transcription processes, across gene’s far and wide on the move at a relatively stable rates. For example, reverse-transcribed SARS-CoV-2 RNA can integrate into the genome of cultured human cells and can be expressed in patient-derived tissues, but the effects of suppression may be sufficient to illicit a more permanent natural defense. In any event insertions and DNA damage are closely related and associated with loss of p53 that results in centrosome amplification.

As cells pass through epithelial to mesenchymal transition (EMT), DNA damage prevents the normal reduction of p53 levels diverting the transcriptional program toward mesoderm without induction of an apoptotic response. In contrast, TP53-deficient cells differentiate to endoderm with high efficiency after DNA damage, suggesting that p53 enforces a “differentiation checkpoint” in early endoderm differentiation that alters cell fate in response to DNA damage.

Reproduction, Blood Pressure and NK

In reproduction, some of the 59 known miRNAs from primate-specific C19MC are highly expressed in human placentas and in the serum of pregnant women. They are also packaged into extracellular vesicles of diverse sizes, including exosomes and endow non-trophoblast cells with resistance to a variety of viruses. At least miR-517a-3p (a C19MC from fetal placenta) was incorporated into maternal NK cells in the third trimester, and it was rapidly cleared after delivery. miRNA's regulate the migration of human trophoblasts and suppress EMT genes critical for maintaining the epithelial cytotrophoblasts stem cell phenotype.

Maternal uterine or decidual Natural Killer cells (dNK) express AT1, AT2, ANP, proteins of Renin Angiotensin System (RAS) suggesting dNK have the potential to contribute to changes in blood pressure that occur between days 5 and 12 of pregnancy in mice. And, pressure related mechanical stretch on endothelial cells interconnects innate and adaptive immune response in hypertension.

Pressure variables in cells and tissues may result from infection, inflammation and membrane stretch, including inner mitochondrial membrane that affects electron transport chain, endoplasmic reticulum, antigen production, presentation and exosome bound p53 / miRNA release. ANP colocalization to dNK’s suggests that dNK RAS, at day 12 infers a localized RAS related responsiveness. STAT3 in monocytes was activated by increased endothelial stretch and is involved in driving almost all of the pathways that control NK cytolytic activity as well as the reciprocal regulatory interactions between NK cells and other components of the immune system. The crosstalk between STAT3 and p53/RAS signaling controls cancer cell metastasis and cisplatin resistance via the Slug/MAPK/PI3K/AKT-mediated regulation of EMT and autophagy.

Educating NK Subsets

Looking into some of the ~15 genes scattered among C19MC (~sixty miRNA's) between 53,671,968 and 54,264,387;

1. MYADM was one of two blood pressure signature genes (copper uptake protein the other) differentially expressed for systolic, diastolic blood pressure and hypertension. Of the ~35 identified genes, several more strongly related to immune cell functions including PRF1, GNLY, TAGAP, IL2RB, GZMB and CD97, NKG7, CLC that are located on chromosome 19. The endothelium maintains a barrier between blood and tissue that becomes more permeable during inflammation. MYADM controls endothelial barrier function through ezrin, radixin, and moesin dependent regulation of ICAM-1 expression an essential receptor for NK interaction.

2. PRPF31 is recruited to introns following the attachment of U4 and U6 (spliceosome) RNA’s. Experiments using PRPF31 determined p53 activation is a general consequence of interfering with the spliceosome.

3. At 54,617,158 LILRB1 receptor is expressed on immune cells where it binds to MHC class I molecules on antigen-presenting cells and transduces a negative signal that inhibits stimulation of an immune response. LILRB1 has a polymorphic regulatory region that enhances transcription in NK Cells and recruits zinc finger protein YY1 that inhibits p53. It also educates expanded human NK cells and deﬁnes a unique antitumor NK cell subset with potent antibody-dependent cellular cytotoxicity.

Monocyte/macrophage immunoglobulin-like receptors (MIR) genes are closely linked to the KIR gene family and the gene for FcαR at 19q13.4. The linkage was discovered in 1997 when a mouse sequence related to MIR mapped to a region on chromosome 7 syntenic with human 19q13.4. In 2012 a cluster of genetic loci, from multiple mouse strains and across anatomical sites was found to jointly contribute to the development of both thymic and splenic invariant natural killer T-cell NKT-cell levels. The dominant cluster was on mouse chromosome 7 and included almost all the non-C19MC genes located within the human C19MC region:– MYADM, CACNG7, VSTM1, TARM1, PRKCC(G), TFPT, NDUFA3, CNOT3, LENG1, TSEN34, RPS9.

Four of nineteen knockout genes, that enhanced NK cell function were on chromosome 19 including GSK3 that phosphorylates Mdm2 to regulate p53 abundance, which would contribute to NK enhancement.

A study of MHC disassortative mating in humans found Israeli’s were more gene similar, but MHC dissimilar than Europeans who were gene dissimilar and MHC dissimilar . Now, a recent study in American Indians found remarkably low KIR and HLA diversity in Amerindians that revealed signatures of strong purifying selection shaping the centromeric KIR region. This narrows to the importance of LILR-KIR region on chromosome19 that codes for the strongest NK cell educator receptors.

p53 regulates exosomes and miRNA’s directly influence NK responsiveness including regulation of dNK during pregnancy. Exosomes regulated by p53 also transfer it and can suppress growth and proliferation of p53 negative cells. Further, miRNA’s, induced by p53 can directly target ULBP2 mRNA and reduce its cell-surface expression.

Disease highlights

rs78378222 polymorphism in the 3'-untranslated region of TP53 contributes to development of age-associated cataracts by modifying miRNA-125b-induced apoptosis of lens epithelial cells. miRNA-125b is a novel negative regulator of p53. Deleting PRPF31 activates the p53 pathway and triggers retinal progenitor cells apoptosis. The members of the miR-125 family (miR-125a on chromosome 19q13.4 and miR-125b on chromosome 21q21.1) reside in two distinct human miRNA clusters with the let-7 and miR-99 families and these miRNAs are thus likely co-transcribed.

More succinctly, NK cells are alerted to induction of p53 in cancer cells by upregulation of the NKG2D ligands ULBP1 and ULBP2. p53 also induces expression of miR-34a and miR-34c, which target ULBP2 mRNA for destabilization. Observations suggest two possibly contrasting roles for p53 in NKG2DL expression and requires more investigation into how the regulation is fine-tuned. Extending this model to human populations would suggest that p53 must be inactivated among those with a robust NK response (those with B haplotypes).

Taken together, our data suggest functional interactions between KIR and HLA modify risks of basal cell carcinoma (BCC) and squamous cell carcinoma, and that KIR encoded by the B genes provide selective pressure for altered p53 in BCC tumors.

Conclusion

The convergence of several important cellular mechanisms that point back to a 19q13.42 address may illustrate ancient and conserved elements that perpetuate and function as integrated biological units effecting blood pressure, reproduction and immunity. Many of these impart education to innate immunity.

Sunday, June 20, 2021

First Intron DNA - Site for a Genetic Brain?

DNA Methylation

The first intron of a gene, regardless of tissue or species is conserved as a site of downstream methylation with an inverse relationship to transcription and gene expression. Therefore, it is an informative gene feature regarding the relationship between DNA methylation and gene expression. But, expression in induced pluripotent stem cells (iPSC's) has been a major challenge to the stem cell industry, because by comparison these cells have not yet reached the state of natural pluripotent or embryonic stem cells (ESC's).

In mice two X chromosomes (XC) are active in the epiblasts of blastocysts as well as in pluripotent stem cells. One XC is inactivated triggered by Xist (non coding) RNA transcripts coating it to become silent. Designer transcription factor (dTF) repressors, binding the Xist intron 1 enhancer region caused higher H3K9me3 methylation and led to XC's opening and X-linked gene repression in MEFs. This substantially improved iPSC production and somatic cell nuclear transfer (SCNT) preimplantation embryonic development. This also correlated with much fewer abnormally expressed genes frequently associated with SCNT, even though it did not affect Xist expression. In stark contrast, the dTF activator targeting the same enhancer region drastically decreased both iPSC generation and SCNT efficiencies and induced ESC differentiation.

A genome-wide, tissue-independent quasi-linear, inverse relationship exists between DNA methylation of the first intron and gene expression. More tissue-specific, differentially methylated regions exist in the first intron than in any other gene feature. These have positive or negative correlation with gene expression, indicative of distinct mechanisms of tissue-specific regulation. CpGs in transcription factor binding motifs are enriched in the first intron and methylation tends to increase with distance from the first exon–first intron boundary, with a concomitant decrease in gene expression.

Since the relationship between sequence, methylation, repression and transcription is determinative in ESC differentiation it may also suggest a broader link to differential translation. Translation is required for miRNA-dependent transcript destabilization that alters levels of coding and noncoding transcripts. But, steady-state abundance and decay rates of cytosolic long non-coding RNA's (lncRNAs) are insensitive to miRNA loss. Instead lncRNAs fused to protein-coding reporter sequences become susceptible to miRNA-mediated decay.

In this model, first intron DNA sequences that are differentially methylated, bind transcription factors that effect transcription, impact splicing, expressions of coding or non-coding transcripts and transcript destabilizations resulting in differential rates and possible variations in translation. This bottom-up, dynamic view of the classical process may elevate the first intron from 'junk' to a DNA 'brain' because it plays a more extensive role, heading the process toward translation of any gene or switching it off entirely.

For this reason, among others Codondex uses first intron k-mers relative to the transcripts mRNA as the basis for comparing same gene transcripts in diseased cells or tissue samples. Further, p53 and BRCA1 miRNA key sequences, discovered using Codondex iScore algorithm, when transfected into HeLa cells resulted in significantly reduced proliferation that may result from this accelerated, transfected miRNA dependent decay.

Thursday, May 13, 2021

Non-Coding DNA Key Sequences

DNA Structural Inherency

Wind two strands of elastic, eventually it will knot, ultimately it will double up on itself. Separate the strands. From the point of unwinding, forces will be directed to different regions and the separation will approximately return to the wound state of the band. Do the same with each of 10 different bands or strings of any type, they will all behave in much the same way. For a given section of DNA being transcribed, the effect of separation will be much the same. For a given gene, there will be sequences that can tolerate force to greater or lesser degrees. For different transcripts, of a gene variation at those sequences may be crucial to the integrity of transcription machinery that separates DNA strands to initiate replication to RNA and for the outcome.

Cellular biology is enormously complex in all regards. The physics of molecular interaction, fluid dynamics, and chemistry combine in a system where cause and effect is near impossible to predict. At the most elementary level we hypothesize some non-coding DNA (ncDNA) possess structural inherencies that can be deployed to direct gene proteins and cell function for diagnosis or therapy.

Coding DNA and its regulatory, non-coding gene compliment is transcribed and spliced from a transcribed gene. Transcription to RNA, edited mRNA, spliced non-coding RNA and ultimately mRNA translation to protein can produce wide ranging, variable outcomes that may not be re-captured experimentally.

A single nucleotide polymorphism (SNP) or SNP combinations within a gene may affect the finely tuned balance that results. Under different environmental conditions this could be material to the protein produced. Additionally other mutations of the gene could add complexity to the environment and/or the resulting protein translation.

At this level of cellular biology, genetic DNA stores instruction for protein assemblies to produce new protein required for the fully functional cell. However, DNA's stored mutations can lead to different functional or non-functional versions of protein depending on many different factors. Relationships between ncDNA, including mutations and the transcripts' edited, protein coding mRNA may represent unexplored inherencies that can regulate the gene's mRNA or translated protein.

We built an algorithm to elaborately compare ncDNA sequences of multiple protein coding transcripts of the same gene. For each transcript it steps through every variable length ncDNA sequence (kmer) (specifically intron1), computes a signature for each and indexes it to the constant of the transcripts' mRNA signature. For each step these signatures order the kmers for each of the transcript's. The order is represented in a vector of all the transcripts being compared.

At millions of successive steps (depending on total intron 1 length's) transcripts mostly retain their vector ordering except, as expected at a kmer length change. Mostly transcript order in the vector does not change, occasionally a few positions change, vary rarely do all positions change. Position changes that cause another, like a domino effect are filtered out. For the rarest positions changes at a step, we look to the root causes in the kmer (sequence). We call this a Key Sequence because it is identified by the significance of changes to transcript positions in the vector compared to the vector at the next step.

Therefore, Key Sequences cause the most position changes between transcripts being compared by the algorithm. This relative measure is step dependent and Key Sequences are discovered by comparing transcript positions in the vector at the next step location. Logically, this infers a genes structural inherency discovered through ncDNA Key Sequence relationships to mRNA, to other transcripts, error in gene alignments, sequenced reads or the algorithm.

In assay testing we were able to predict and synthesize non-coding RNA Key Sequences that significantly reduced proliferation of HeLa cells. In our pre-clinical work, based on comparisons to transcripts of the TP53 we will be predicting the efficacy of cell and tissue selections that educate and activate Natural Killer cells.

If Key Sequences are inherent they could open a new frontier for diagnosis and therapy.

Monday, March 8, 2021

Custom Immunotherapy To Address Dimorphic Complexities.

Dimorphic relationships between genes on Chromosome (Chr)6, encoding Human Leukocyte Antigens (HLA) and those on Chr19, encoding Killer-cell immunoglobulin-like receptors (KIRs) may eventually uncover important information as to how, why and when Natural Killer (NK) cells determine self restraint or attack cells infected by pathogens and disease. These proteins emerge from their respective zones, on each chromosome that have and continue to be subject to frequent recombination events.

The active region of Chr19 has a long history of recombinations that have and continue to define the expression patterns of telomeric and centromeric proportions of KIR gene's encoding receptors that bind cells presenting MHC class 1, HLA haplotype combinations that vary significantly across tissues in different population groups. Adding complexity, HLA genes on Chr6 are also subject to significant recombination making the dimorphic functional HLA-KIR interactions difficult to predict.

Studies across population groups reveal the great diversity of HLA-KIR dimorphisms. The Southern Han centromeric KIR region encodes strong, conserved, inhibitory HLA-C-specific receptors, and the telomeric region provides a high number and diversity of inhibitory HLA-A and -B-specific receptors. In all these characteristics, the Chinese Southern Han represent other East Asians, whose NK cell repertoires are thus enhanced in quantity, diversity, and effector strength, likely augmenting resistance to endemic viral infections.

One study goes much further suggesting that functional interactions between KIR and HLA modify risks of basal cell carcinoma (BCC) and squamous cell carcinomas (SCC) and that KIR B haplotypes provide selective pressure for altered p53 in BCC tumors. This preference implicates multi-modal p53 mechanisms that are also known to upregulate NK ligands, induce HLA-A11 assembly against Epstein Bar Virus and bind a frequently mutated p53 peptide in a complex with HLA-A and presented at the cell surface that prevent T-Cell response. In support, selected p53 mutations altering protein stability can modulate p53 presentation to T cells, leading to a differential immune reactivity inversely correlated with measured p53 protein levels.

In addition to KIR, adaptive NKG2C+ NK cells display fine peptide specificity selectively to recognize HCMV strains that differed by a single substitution in the HLA-E-binding UL40-derived peptide during infection. Distinct peptides controlled the degree of proliferation in synergy with pro-inflammatory cytokines. Viral peptides are known to augment inhibition at NKG2A. Conversely, NKG2A+ NK cells sense MHC class I downregulation more efficiently than KIRs. Thus, both receptor:ligand systems appear to have complementary functions in recognizing changes in MHC class I.

Polymorphic landscapes across HLA, KIR and NKG receptor repertoires coupled with receptor:ligand haplotype cross referencing makes it near impossible to predict therapeutic targets across the breadth of disease and disease combinations that affect populations. A recent KIR-HLA co-existence study of haplotypes in Breast Cancer patients and controls highlights this complexity.

Genetic signatures that target discovery of desired cell functionality to select preferential cells/tissues from micro environments used to educate and license autologous or allogeneic NK cells may tease specific, finely tuned, intact receptor repertoires. Once licensing efficacy is reached, expanding NK cell populations and applying them to act upon previously unrecognizable cells of a patient becomes the next frontier of immune therapy. This is the exciting work presently being undertaken by researchers and staff working with Precision Autology using Codondex methodologies.

Saturday, February 13, 2021

Cell's with an Index like Google?

Its been a while since I last wrote about DNA repeats or their RNA descendants. In that time advanced research has emerged relating repeats to increasing numbers of viral or other disease. Generally the repeats of interest here can be either long or short sequences of nucleotides that from part of an unspliced gene. Logically, counts of long sequences that repeat would be less than short sequences, but when normalized to their respective nucleotide lengths the indexed results can shift the relative order of repeating sequences quite dramatically.

In most knowledge systems repeats in low level data present redundancy and opportunity to improve efficacy in local or global upstream processes acting on that data. We see this in the structure of efficient alphabets that had a significant impact on whether or not a language survived continuous use. Why use ten words when precise meaning, including abstracts can be derived from three. Or why alpha when, at least for some period in the language history alphanumeric made it more effective?

Search engines reduce their primary index to the least redundant data set used to drive efficient data access by upstream requests and processes to satisfy any query. However, at the storage level, data redundancy is permitted because energy efficiency is gained. Similarly genetic DNA is massively redundant. Redundant data stores can make highly indexed systems more efficient because frequently accessed data elements are more accessible at multiple locations and parallel processes can more efficiently satisfy upstream requests.

Repetitive sequences constitute 50%–70% of the human genome. Some of these can transpose positions, these transposable elements (TE's) are DNA transposons and retrotransposons. The latter are predominant in most mammals and can be further divided into long terminal repeat (LTR)-containing endogenous retrovirus transposons and non-LTR transposons including short interspersed nuclear elements (SINEs) and long interspersed nuclear elements (LINEs). The most abundant subclass of SINEs comprises primate-specific Alu elements in human with more abundant GC-rich DNA. Humans have up to 1.4 million copies of these repeats, which constitute about 10.6% of the genomic DNA. Long interspersed element-1 (LINE1 or L1), are abundant in AT-rich DNA, constitute 19% of the human genome and make up the largest proportion of transposable element-derived sequences.

Most TE classes are primarily involved in reduced gene expression, but Alu elements are associated with up regulated gene expression. Intronic Alu elements are capable of generating alternative splice variants in protein-coding genes that illustrate how Alu elements can alter protein function or gene expression levels. Non-coding regions were found to have a great density of TEs within regulatory sequences, most notably in repressors. TEs have a global impact on gene regulation that indicates a significant association between repetitive elements and gene regulation.

In liquid systems, phase separation is one of the most fundamental phase transition phenomena and ubiquitous in nature. De-mixing of oil and water in salad dressing is a typical example. The discovery of biological phase separation in living cells led to the identification that phase-separation dynamics are controlled by mechanical relaxation of the network-forming dense phase, where the limiting process is permeation flow of the solvent for colloidal suspensions and heat transport for pure fluids. The application of this derived governing universal law is a step to understanding and defining the liquid biological indexing equivalence of data-processing systems and inherent genetic redundancy.

Repeats have been widely implicated. In plant immunity a TE has been domesticated through histone marks and generation of alternative mRNA isoforms that were both directly linked to immune response to a particular pathogen. p53 transcription sites evolved through epigenetic methylation, deamination and histone regulation that constituted a universal mechanism found to generate various transcription-factor binding sites in short TE's or Alu repeats. In disease cytoplasmic synthesis of Alu cDNA was implicated in age related macular degeneration and there is transient increase of nearly 20-fold in the levels of Alu RNA during stress, viral infection and cancer.

In chromosomal DNA, each sequence, relative to its length may conveniently describe a phase-separated indexed location and method for discovery. Repeats within genetic DNA may present precisely sensitive phase-separated guidance to drive histone, epigenetic and transcription factors to specific genetic locations at the cells' 'end-of-line' from where the genetic response to upstream membrane bound changes begin.

Thursday, September 24, 2020

$100,000 Biohunt

Some of the past research on neoantigen and p53 antibodies in immunity has been encouraging. The data is enormously complex, but keeps pointing to TP53's great potential. To this end, we were anxious to start our mega-experiment, but were delayed by C19, now I'm glad to report we are well underway. In co-operation with researchers at UCLA we aim to determine whether Codondex transcript analysis, of TP53 can predict the best tumor tissue selection for most effective Natural Killer (NK) cell priming, activation and cell killing, including in autologous tumor micro environments.

We're hoping to to achieve a result along the path toward our ambitious clinical goal. We aim to prove that a specifically selected section from biopsied tissue can be used to effectively prime autologus NK cells for patient reapplication and disease treatment.

This co-culture vs. sequencing challenge uses sections (T1-T8) taken from each of two tumors. Each section is co-cultured with 2 treated NK cell and one naive NK cell line and tests the efficacy of NK cell cytotoxicity against tumor cell and tumor tissue in killing assays. Separately, by sequencing TP53 of each selection and computing Codondex iScore(TM) algorithm we hope to identify specific features of each tissue selection that point computed results to research outcomes.

Co-culture vs. Sequencing Challenge

To better understand the analysis and encourage research contributions we are inviting applicants for first grants directed toward this objective.

Codondex tools analyse genetic sequences at an arbitrary number of nucleotides. The tool provides an easy way to observe fine repetitive details of small subsequences contained within a gene. We compute various metrics for each subsequence including 'Inclusiveness', which measures the total occurrences of every computed smaller subsequence is found within the subsequence of interest.

Our primary interest is intronic, non-coding DNA in multi-transcript genes. In these systems we create a transcript list, which we call the Vector, that is sorted by Codondex i-Score. This metric looks at Inclusiveness scaled by the length of the subsequence, to better account for intrinsic probability of finding smaller subsequences within progressively longer ones. Using this we look at the way order of this vector changes from subsequence to subsequence. Large changes in these vectors then prompts us to tag them for further investigation as it represents large deviation from transcript similarity, with this subsequence being labelled a Key Sequence.

Codondex is proposing 3 grants for open problems to aid in our journey towards a more biologically useful platform. These 3 problems span statistical analysis, data acquisition and biological relevance of various aspects that are integral to our platform.

Applicants should inquire further and sign up here.

Monday, June 8, 2020

Oil and Water and Cellular Function

Genetic DNA are single acid nucleotide's stringed along a sugar-phosphate spine that winds around proteins, called histones and collapses into a chromosome assembly. At specific 'gene' locations DNA are often unwound and replicated into smaller, related RNA strings that can be incorporated by clustered proteins to attract and assemble amino acid combinations that may fold into functional proteins. Aqueous proteins aggregate in complex units and interact with DNA, RNA, amino acids and other proteins to build life on planet earth.

Entropy can disrupt the order of liquid-liquid phase separation (LLP) and other density based separations that govern events effecting DNA and are central to cellular bio-physics. Since the discovery of DNA in 1869 and its double helix structure in 1953, research has been directed to decipher the vast string assemblies of billions of these ordered acid combinations that govern cells of different species. Recently research has more beautifully described how orders of short repeating DNA sequences govern cellular mechanics and provides insight to the delicate balance in aqueous separations.

Chromosomes of cells that divide and replicate are tethered via centromere including concentrated short, ordered DNA combinations repeated at extending distances along the sugar-phosphate spine. They attract proteins and other epigenetic factors that may direct the cells centrosome - a protein tube geared to a vast cytoskeleton spindle to move chromosomes and the cells skeletal structure in response to activity on its centromere and distant regions.

Intron regions of genes are considered regulatory since exons or DNA coding regions, when replicated into RNA exclusively translate combinations of amino acids for protein. The intron regions of yeast centromeres were found to promote formation of centromeric heterochromatim - DNA wound around histones and methylated to repress regions and maintain lineage during replication.

A study of centromere heterochromatin surprisingly showed that distant euchromatic regions enriched in repressed methylated genes also interacted with the hierarchical organization of centromeric DNA. These 3D spacial interactions are likely mediated by LLP (similar to how oil and vinegar separate in salad dressing), resulting liquid-like fusion events and can influence the fitness of individuals. Repressed gene's were identified as Transposable Elements (TE's), sequences often associated with pathogenic DNA insertions that have been persistently retained.

A study found 96.3% of TEs enriched in 156 gene bodies overlapped introns, in line with the normally observed distribution of introns and exons in the human genome. Across cells in different tissues, genes that are consistently replicated are less likely to be associated with TE's. Multiple TE's in tissue-specific, active regulatory regions are enriched in intron enhancer sequences to attract and bind protein transcription factors as master replication regulators.

TE's have mostly been analyzed by the frequency of short identical repeating sequences, but methods have not revealed the full extent of the TE repeat hierarchy. When any part of TE's are replicated and released from their sugar-phosphate spine the hierarchy of repeats may effect dissociation. Codondex built a uniform analytic to tease out the inherent hierarchy of repeating sequences that may expose separation potential whether or not the DNA is classified as a TE.

As outlined, repressed DNA regions with more frequent repeats are less actively replicated into RNA. Therefore, actively transcribed regions yield more RNA for coding proteins and edited intron RNA can accumulate to concentrate in the liquid nucleus, be transported to the cytoplasm or be degraded. A cell's machinery must be finely tuned to process the RNA remnant of DNA replication, but mutations and aberrant separations can disrupt the order of these finely tuned micro-organisms.

If repeats define a universal separation hierarchy that is heavily weighted toward regulatory introns then de novo chromosome and gene repeat analysis may identify distant and centromeric influences to the centrosome. The iScore(TM) algorithm repeatedly explodes any DNA or RNA string into its ordered, theoretical hierarchy of repeats until the smallest required string length and may provide a structural basis for liquid separations. A repeat-hierarchy, for any gene would have to also relate to its chromosome repeats for inherent, universal influence over 3D spacial interaction and potentially cell function.

The complete record of repeats for an average length gene explodes to 100,000,000+ ordered strings representing its iScore signature. If a repeat hierarchy does exit for aqueous aggregations, a gene transcripts' intron iScore should be sufficient to measure and compare its inherent repeat potential to other transcripts. Significant consecutive iScore variations with any of the 100,000,000+ strings could be used to expose systemic, structural separation differences for that transcript in context of other transcripts in their aqueous environments.

Thursday, December 19, 2019

Therapeutic Coding and non-Coding DNA Relationships

Relationships of coding and non-coding intra-gene DNA are good cause for intense research and scientific debate. Many cellular functions of non coding DNA have been discovered in the past 30 years, but prior to that these genomic regions were mostly considered 'junk'.

Probing relationships between a genes' protein coding, cDNA and at least one non-coding DNA section of the transcript, which in our work is intron1 can yield important data about genomic features in the combination. Over the past 7 years we focused on interrogating combination relationships, across multiple transcripts to construct intra-gene DNA signatures from apparently disparate DNA elements that are known to perform vastly different biological functions, yet are proximal and often adjacent.

First we considered codon to amino acid coding may operate a little different to the classical view if reading a first and second nucleotide made the third deterministic. This method would not alter the outcome of known protein coding, but it may alter the way we consider combination relationships between nucleotide's. For a transcript, any given length of cDNA and its respective intron1 sequence could possess undiscovered intrinsic order. In a model where order was tightly honored, transcript relativity may identify cDNA sequences that caused significant change in the order at each next nucleotide step.

To investigate transcripts, from the first nucleotide we computed every length cDNA k-mer. We associated k-mer's, of every possible length with the cDNA transcripts intron1 signature. Then, for a set of multiple same gene transcripts, in nucleotide order our algorithm ordered the transcripts into a vector based on their respective cDNA-kmer:intron1-signatures. Stepping through from one k-mer to the next we observed whether next k-mer significantly changed the order of transcripts in the vector. After filtering domino effects we ranked k-mers with the most significantly changed transcript order from the previous k-mer.

Size of circle 'K' in the example indicates k-mer length, but we only compare same length K

In the above example, it is evident that k-mer2 vs k-mer3 was the most changed because all three transcript positions moved without a domino effect. From the vector we identify intra:inter transcript conditions in next nucleotide relationships as represented in the k-mers.

As an example, in our work with 15 viable consensus transcripts for p53 occasionally all 15 transcripts in the vector changed positions at the next k-mer. These intra transcript k-mer relationships govern the transcripts order in the vector, but when, at the next k-mer transcript order is relaxed and positions move, particularly where the significant majority of positions move it is indicative that the intra transcript k-mer condition is relative to other transcript k-mers in the vector. The more and the further transcripts move positions in the vector the more relevant their intra transcript k-mer relationships are likely to be to gene.

This transcript comparative presents a new method for diagnosis and therapy because each new transcript, when compared to the consensus set has the capacity to disrupt order in the vector and yield k-mers that are specifically relevant to the gene. In our assay testing we were able to predict and synthesize ncRNA sequences that significantly reduced proliferation of HeLa cells. In our pre-clinical work, based on comparisons to transcripts of the TP53 consensus we will be predicting the efficacy of cell and tissue selections that educate and activate Natural Killer cells.

Pre-clinical flow chart to educate NK cells with tumor tissue/cell co-cultures and prove prediction

Codondex