The first comprehensive overview of preprocessing, mining,
and postprocessing of biological data

Molecular biology is undergoing exponential growth in both the
volume and complexity of biological data--and knowledge
discovery offers the capacity to automate complex search and data
analysis tasks. This book presents a vast overview of the most
recent developments on techniques and approaches in the field of
biological knowledge discovery and data mining (KDD)--providing
in-depth fundamental and technical field information on the most
important topics encountered.

Written by top experts, Biological Knowledge Discovery
Handbook: Preprocessing, Mining, and Postprocessing of Biological
Data covers the three main phases of knowledge discovery (data
preprocessing, data processing--also known as data
mining--and data postprocessing) and analyzes both verification
systems and discovery systems.

BIOLOGICAL DATA PREPROCESSING

* Part A: Biological Data Management

* Part B: Biological Data Modeling

* Part C: Biological Feature Extraction

* Part D Biological Feature Selection

BIOLOGICAL DATA MINING

* Part E: Regression Analysis of Biological Data

* Part F Biological Data Clustering

* Part G: Biological Data Classification

* Part H: Association Rules Learning from Biological Data

* Part I: Text Mining and Application to Biological Data

* Part J: High-Performance Computing for Biological Data
Mining

Combining sound theory with practical applications in molecular
biology, Biological Knowledge Discovery Handbook is ideal
for courses in bioinformatics and biological KDD as well as for
practitioners and professional researchers in computer science,
life science, and mathematics.



Autorentext

MOURAD ELLOUMI is a Full Professor in Computer Science at the University of Tunis-El Manar, Tunisia. He is the author/coauthor of more than fifty publications in international journals and conference proceedings and the coeditor, along with Albert Zomaya, of Algorithms in Computational Molecular Biology: Techniques, Approaches and Applications (Wiley).

ALBERT Y. ZOMAYA is the Chair Professor of High Performance Computing & Networking at The University of Sydney's School of Information Technologies. He is the author/coauthor of seven books, more than 450 publications in technical journals and conference proceedings, and the editor of fourteen books and nineteen conference volumes. He is a Fellow of the IEEE, the American Association for the Advancement of Science, and IET (UK).



Klappentext

The first comprehensive overview of preprocessing, mining, and postprocessing of biological data

Molecular biology is undergoing exponential growth in both the volume and complexity of biological data—and knowledge discovery offers the capacity to automate complex search and data analysis tasks. This book presents a vast overview of the most recent developments on techniques and approaches in the field of biological knowledge discovery and data mining (KDD)—providing in-depth fundamental and technical field information on the most important topics encountered.

Written by top experts, Biological Knowledge Discovery Handbook: Preprocessing, Mining, and Postprocessing of Biological Data covers the three main phases of knowledge discovery (data preprocessing, data processing—also known as data mining—and data postprocessing) and analyzes both verification systems and discovery systems.

BIOLOGICAL DATA PREPROCESSING

  • Part A: Biological Data Management
  • Part B: Biological Data Modeling
  • Part C: Biological Feature Extraction
  • Part D Biological Feature Selection

BIOLOGICAL DATA MINING

  • Part E: Regression Analysis of Biological Data
  • Part F Biological Data Clustering
  • Part G: Biological Data Classification
  • Part H: Association Rules Learning from Biological Data
  • Part I: Text Mining and Application to Biological Data
  • Part J: High-Performance Computing for Biological Data Mining

Combining sound theory with practical applications in molecular biology, Biological Knowledge Discovery Handbook is ideal for courses in bioinformatics and biological KDD as well as for practitioners and professional researchers in computer science, life science, and mathematics.



Inhalt

PREFACE xiii

CONTRIBUTORS xv

SECTION I BIOLOGICAL DATA PREPROCESSING

PART A: BIOLOGICAL DATA MANAGEMENT

1 GENOME AND TRANSCRIPTOME SEQUENCE DATABASES FOR DISCOVERY, STORAGE, AND REPRESENTATION OF ALTERNATIVE SPLICING EVENTS 5
Bahar Taneri and Terry Gaasterland

2 CLEANING, INTEGRATING, AND WAREHOUSING GENOMIC DATA FROM BIOMEDICAL RESOURCES 35
Fouzia Moussouni and Laure Berti-Equille

3 CLEANSING OF MASS SPECTROMETRY DATA FOR PROTEIN IDENTIFICATION AND QUANTIFICATION 59
Penghao Wang and Albert Y. Zomaya

4 FILTERING PROTEINPROTEIN INTERACTIONS BY INTEGRATION OF ONTOLOGY DATA 77
Young-Rae Cho

PART B: BIOLOGICAL DATA MODELING

5 COMPLEXITY AND SYMMETRIES IN DNA SEQUENCES 95
Carlo Cattani

6 ONTOLOGY-DRIVEN FORMAL CONCEPTUAL DATA MODELING FOR BIOLOGICAL DATA ANALYSIS 129
Catharina Maria Keet

7 BIOLOGICAL DATA INTEGRATION USING NETWORK MODELS 155
Gaurav Kumar and Shoba Ranganathan

8 NETWORK MODELING OF STATISTICAL EPISTASIS 175
Ting Hu and Jason H. Moore

9 GRAPHICAL MODELS FOR PROTEIN FUNCTION AND STRUCTURE PREDICTION 191
Mingjie Tang, Kean Ming Tan, Xin Lu Tan, Lee Sael, Meghana Chitale, Juan Esquivel-Rodrýguez, and Daisuke Kihara

PART C: BIOLOGICAL FEATURE EXTRACTION

10 ALGORITHMS AND DATA STRUCTURES FOR NEXT-GENERATION SEQUENCES 225
Francesco Vezzi, Giuseppe Lancia, and Alberto Policriti

11 ALGORITHMS FOR NEXT-GENERATION SEQUENCING DATA 251
Costas S. Iliopoulos and Solon P. Pissis

12 GENE REGULATORY NETWORK IDENTIFICATION WITH QUALITATIVE PROBABILISTIC NETWORKS 281
Zina M. Ibrahim, Alioune Ngom, and Ahmed Y. Tawfik

PART D: BIOLOGICAL FEATURE SELECTION

13 COMPARING, RANKING, AND FILTERING MOTIFS WITH
CHARACTER CLASSES: APPLICATION TO BIOLOGICAL SEQUENCES ANALYSIS 309
Matteo Comin and Davide Verzotto

14 STABILITY OF FEATURE SELECTION ALGORITHMS AND ENSEMBLE FEATURE SELECTION METHODS IN
BIOINFORMATICS 333
Pengyi Yang, Bing B. Zhou, Jean Yee-Hwa Yang, and Albert Y. Zomaya

15 STATISTICAL SIGNIFICANCE ASSESSMENT FOR BIOLOGICAL FEATURE SELECTION: METHODS AND ISSUES 353
Juntao Li, Kwok Pui Choi, Yudi Pawitan, and Radha Krishna Murthy Karuturi

16 SURVEY OF NOVEL FEATURE SELECTION METHODS FOR CANCER CLASSIFICATION 379
Oleg Okun

17 INFORMATION-THEORETIC GENE SELECTION IN EXPRESSION DATA 399
Patrick E. Meyer and Gianluca Bontempi

18 FEATURE SELECTION AND CLASSIFICATION FOR GENE EXPRESSION DATA USING EVOLUTIONARY COMPUTATION 421
Haider Banka, Suresh Dara, and Mourad Elloumi

SECTION II BIOLOGICAL DATA MINING

PART E: REGRESSION ANALYSIS OF BIOLOGICAL DATA

19 BUILDING VALID REGRESSION MODELS FOR BIOLOGICAL DATA USING STATA AND R 445
Charles Lindsey and Simon J. Sheather

20 LOGISTIC REGRESSION IN GENOMEWIDE ASSOCIATION ANALYSIS 477
Wentian Li and Yaning Yang

21 SEMIPARAMETRIC REGRESSION METHODS IN LONGITUDINAL DATA: APPLICATIONS TO AIDS CLINICAL TRIAL DATA 501
Yehua Li

PART F: BIOLOGICAL D…

Titel
Biological Knowledge Discovery Handbook
Untertitel
Preprocessing, Mining and Postprocessing of Biological Data
EAN
9781118617113
ISBN
978-1-118-61711-3
Format
E-Book (pdf)
Hersteller
Herausgeber
Veröffentlichung
24.12.2013
Digitaler Kopierschutz
Adobe-DRM
Dateigrösse
40.13 MB
Anzahl Seiten
1192
Jahr
2013
Untertitel
Englisch