91. Smartphone-Based Recognition of Human Activities and Postural Transitions: Activity recognition data set built from the recordings of 30 subjects performing basic activities and postural transitions while carrying a waist-mounted smartphone with embedded inertial sensors. 79. SPECT Heart: Data on cardiac Single Proton Emission Computed Tomography (SPECT) images. Divorce Predictors data set: Participants completed the Personal Information Form and Divorce Predictors Scale. Parkinson Speech Dataset with Multiple Types of Sound Recordings: The training data belongs to 20 Parkinson's Disease (PD) patients and 20 healthy subjects. 11. 57. Abstract: Lung cancer data; no attribute definitions. Diabetes 130-US hospitals for years 1999-2008: This data has been prepared to analyze factors related to readmission as well as other 51. Arrhythmia: Distinguish between the presence and absence of cardiac arrhythmia and classify it in one of the 16 groups. 60. A soft X-ray technique and GRAINS package were used to construct all seven, real-valued attributes. I am working on a project to classify lung CT images (cancer/non-cancer) using CNN model, for that I need free dataset with annotation file. ... , lung, lung cancer, nsclc , stem cell. 1992-05-01. Breath Metabolomics: Breath analysis is a pivotal method for biological phenotyping. There may be multiple rows per patientId. This file contains a List of Risk Factors for Cervical Cancer leading to a Biopsy Examination! View Dataset. The dataset contains one record for each of the approximately 155,000 participants in the PLCO trial. This dataset is related to classification and predictive tasks. QSAR Bioconcentration classes dataset: Dataset of manually-curated Bioconcentration factor (BCF, fish) and mechanistic classes for QSAR modeling. Lymphography: This lymphography domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. 96. Without low-dose spiral computed tomography (LDCT) screening, lung cancer is usually not found until a person develops symptoms, when the disease is more advanced and is more difficult to treat. 72. Anuran Calls (MFCCs): Acoustic features extracted from syllables of anuran (frogs) calls, including the family, the genus, and the species labels (multilabel). Thyroid Disease: 10 separate databases from Garavan Institute, 33. 86. Breast Cancer Wisconsin (Diagnostic): Diagnostic Wisconsin Breast Cancer Database. [15], it is aimed to classify tumor and normal cells for diagnostic purpose; while in the lung cancer data set [9], it is aimed to differentiate two types of disease. 84. chipseq: ChIP-seq experiments characterize protein modifications or binding at Echocardiogram: Data for classifying if patients will survive for at least one year after a heart attack, 13. 22. Amphibians: The dataset is a multilabel classification problem. 113. Parkinson Dataset with replicated acoustic features : Contains acoustic features extracted from 3 voice recording replications of the sustained /a/ phonation for each one of the 80 subjects (40 of them with Parkinson's Disease). Sperm concentration are related to socio-demographic data, environmental factors, health status, and life habits. Main. Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals. been collected retrospectively during the years 2007-2011 . The . Lung Cancer: Lung cancer data; no attribute definitions. Used to predict heavy drinking episodes via mobile data. MicroMass: A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. It visualizes the data in 3D and trains a 3D convolutional network on the data … Data Set Characteristics: Multivariate. 22. * Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. Hello everyone! Aim: assess whether voice rehabilitation treatment lead to phonations considered 'acceptable' or 'unacceptable' (binary class classification problem). 70. Arcene: ARCENE's task is to distinguish cancer versus normal patterns from mass-spectrometric data. specific genomic locations in specific samples. 59. Cervical Cancer Risk Factors for Biopsy: This Dataset is Obtained from UCI Repository and kindly acknowledged! 2. Nuclear feature extraction for breast tumor diagnosis. 63. sEMG for Basic Hand movements: The sEMG for Basic Hand movements includes 2 databases of surface electromyographic signals of 6 hand movements using Delsys' EMG System. Breast Cancer: Breast Cancer Data (Restricted Access), 6. Number of Attributes: 56. Codon usage: DNA codon usage frequencies of a large sample of diverse biological organisms from different taxa, Molecular Biology (Promoter Gene Sequences), Molecular Biology (Protein Secondary Structure), Molecular Biology (Splice-junction Gene Sequences), KEGG Metabolic Relation Network (Directed), KEGG Metabolic Reaction Network (Undirected), One-hundred plant species leaves data set, Reuters RCV1 RCV2 Multilingual, Multiview Text Categorization Test collection, Tamilnadu Electricity Board Hourly Readings, Diabetes 130-US hospitals for years 1999-2008, Parkinson Speech Dataset with Multiple Types of Sound Recordings, Smartphone-Based Recognition of Human Activities and Postural Transitions, Quality Assessment of Digital Colposcopies, Early biomarkers of Parkinson�s disease based on natural connected speech, Autistic Spectrum Disorder Screening Data for Children, Autistic Spectrum Disorder Screening Data for Adolescent, Activity recognition with healthy older people using a batteryless wearable sensor, Simulated Falls and Daily Living Activities Data Set, EEG Steady-State Visual Evoked Potential Signals, Early biomarkers of Parkinson’s disease based on natural connected speech Data Set, Parkinson Dataset with replicated acoustic features, Hepatitis C Virus (HCV) for Egyptian patients, Shoulder Implant X-Ray Manufacturer Classification, Estimation of obesity levels based on eating habits and physical condition, Activity recognition using wearable physiological measurements. The lung cancer specialists at UCI Health provide complete diagnostic testing and leading-edge treatments for patients with lung cancer and other diseases and disorders affecting the lungs and airways. svm sklearn pandas breast-cancer-wisconsin Updated Jun 10, 2019; Jupyter Notebook; pranath / breast_cancer_prediction Star 0 Code Issues Pull requests In this project I will look at a dataset of patient data relating to breast cancer, and develop a machine learning model that will aim to predict Malignant … Audiology (Original): Nominal audiology dataset from Baylor, 4. The Lung dataset is a comprehensive dataset that contains nearly all the PLCO study data available for lung cancer screening, incidence, and mortality analyses. 101. 76. extention of Z-Alizadeh sani dataset: It was collected for CAD diagnosis. 109. Wilt: High-resolution Remote Sensing data set (Quickbird). 82. Forest type mapping: Multi-temporal remote sensing data of a forested area in Japan. Breast cancer predictions using UCI's Breast cancer Wisconsin dataset. 2003. Daphnet Freezing of Gait: This dataset contains the annotated readings of 3 acceleration sensors at the hip and leg of Parkinson's disease patients that experience freezing of gait (FoG) during walking tasks. 53. 38. HCV data: The data set contains laboratory values of blood donors and Hepatitis C patients and demographic values like age. ILPD (Indian Liver Patient Dataset): This data set contains 10 variables that are age, gender, total Bilirubin, direct Bilirubin, total proteins, albumin, A/G ratio, SGPT, SGOT and Alkphos. SCADI: First self-care activities dataset based on ICF-CY. Lung Cancer Data 1. Lung Cancer: Lung cancer data; no attribute definitions. 95. 9. Hepatitis C Virus (HCV) for Egyptian patients: Egyptian patients who underwent treatment dosages for HCV about 18 months. Street, W.H. We use Table 4 to summarize the background information of 6 data sets for the subtype classification of the childhood leukemia disease. 103. One-hundred plant species leaves data set: Sixteen samples of leaf each of one-hundred plant species. Parkinsons: Oxford Parkinson's Disease Detection Dataset. The cancer center’s success with investigative drugs to block cancer-related genetic mutations is well known. Papers were automatically harvested and associated with this data set, in collaboration with Rexa.info. Soybean (Small): Michalski's famous soybean disease database. Mushroom: From Audobon Society Field Guide; mushrooms described in terms of physical characteristics; classification: poisonous or edible, 25. This is a two-class classification problem with continuous input variables. Refractive errors: Effect of life style and genetic on eye refractive errors. Localization Data for Person Activity: Data contains recordings of five people performing different activities. 21 datasets were created from 12 bioassays. 4 min read. 31. Dermatology: Aim for this dataset is to determine the type of Eryhemato-Squamous Disease. 44. 67. Expression data from human lung cancer cell line NCI-H292 (Submitter supplied) CT45 family is abnormally overexpressed in various types of cancer. 42. Neural Network - **Hyperparameters tuning** Single parameter trainer mode fully connected perceptron 200 perceptron learning rate - 0.001 learning iterations - 200 initial learning weights - 0.1 min-max normalizer shuffled … Computer-Aided Diagnosis & Therapy, Siemens Medical Solutions, Inc. including a medical dataset on detection of lung cancer from medical images. The machine learning Hybrid Search of Feature Subsets. Let’s say you are interested in the samples 10, 50, and 85, and want to know their class name. 74. gene expression cancer RNA-Seq: This collection of data is part of the RNA-Seq (HiSeq) PANCAN data set, it is a random extraction of gene expressions of patients having different types of tumor: BRCA, KIRC, COAD, LUAD and PRAD. 20. Exasens: This repository introduces a novel dataset for the classification of 4 groups of respiratory diseases: Chronic Obstructive Pulmonary Disease (COPD), asthma, infected, and Healthy Controls (HC). EEG Eye State: The data set consists of 14 EEG values and a value indicating the eye state. 94. Return to Lung Cancer data set page . From all subjects, multiple types of sound recordings (26) are taken. Mangasarian. HIV-1 protease cleavage: The data contains lists of octamers (8 amino acids) and a flag (-1 or 1) depending on whether HIV-1 protease will cleave in the central position (between amino acids 4 and 5). Lymphography: This lymphography domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Testing data set from stratified random sample of image. Thoracic Surgery for Lung Cancer Data Set. UCI Machine Learning Repository: Lung Cancer Data Set: Support. 104. Bar Crawl: Detecting Heavy Drinking: Accelerometer and transdermal alcohol content data from a college bar crawl. Acute Inflammations: The data was created by a medical expert as a data set to test the expert system, Yeast: Predicting the Cellular Localization Sites of Proteins, 34. EEG Steady-State Visual Evoked Potential Signals: This database consists on 30 subjects performing Brain Computer Interface for Steady State Visual Evoked Potentials (BCI-SSVEP). Drug Review Dataset (Drugs.com): The dataset provides patient reviews on specific drugs along with related conditions and a 10 star patient rating reflecting overall patient satisfaction. Associated Tasks: Classification. DICOM Images The ability to convert SVM's and other "black-box" classifiers into a set of human-understandable rules, is critical not only for physician, Using Rules to Analyse Bio-medical Data: A Comparison between C4.5 and PCL, Rule extraction from Linear Support Vector Machines. Healthy subjects conducted six daily life grasps. It focuses on characteristics of the cancer, including information not available in the Participant dataset. Reuters RCV1 RCV2 Multilingual, Multiview Text Categorization Test collection: This test collection contains feature characteristics of documents originally written in five different languages and their translations, over a common set of 6 categories. 89. Iris: Famous database; from Fisher, 1936, 19. Heart Disease: 4 databases: Cleveland, Hungary, Switzerland, and the VA Long Beach, 16. Note that there is also a related Breast Cancer Wisconsin (Original) Data Set with a different set of… Zoo: Artificial, 7 classes of animals, 35. Activity recognition using wearable physiological measurements: This dataset contains features from Electrocardiogram (ECG), Thoracic Electrical Bioimpedance (TEB) and the Electrodermal Activity (EDA) for activity recognition. We are working to ensure that future patients have better diagnostic and treatment options, as well as access to therapies that improve quality of life and offer relief from pain and discomfort. Visualising and exploring Breast Cancer data set to predict cancer. outcomes pertaining to patients with diabetes. Early biomarkers of Parkinson’s disease based on natural connected speech Data Set : . Current dataset was adapted to ARFF format from the UCI version. 92. Cardiotocography: The dataset consists of measurements of fetal heart rate (FHR) and uterine contraction (UC) features on cardiotocograms classified by expert obstetricians. 3. Horse Colic: Well documented attributes; 368 instances with 28 attributes (continuous, discrete, and nominal); 30% missing values, 18. Abalone: Predict the age of abalone from physical measurements. Nasarian CAD Dataset: This dataset comprises records of 150 subjects (all male employees in Iran have visited the Abadan Occupational (Industrial) Medicine Clinic) and 52 features. Quality Assessment of Digital Colposcopies: This dataset explores the subjective quality assessment of digital colposcopies. 64. Epileptic Seizure Recognition: This dataset is a pre-processed and re-structured/reshaped version of a very commonly used dataset featuring epileptic seizure detection. 47. Each person wore four sensors (tags) while performing the same scenario five times. Molecular Biology (Promoter Gene Sequences): E. Coli promoter gene sequences (DNA) with partial domain theory. data-mining image-classification lung-cancer-detection biomedical-image-analysis Updated Oct 8, 2020; sid0407 / LungCT_Diagnosis Star 0 Code Issues Pull requests This repository processes CT scan images of human lungs available as DICOM image format. This dataset is one of 5 datasets of the NIPS 2003 feature selection challenge. Using Rules to Analyse Bio-medical Data: A Comparison between C4.5 and PCL. Paul R. Kennedy. Yes. … Breast Tissue: Dataset with electrical impedance measurements of freshly excised tissue samples from the breast. 56. 50. seeds: Measurements of geometrical properties of kernels belonging to three different varieties of wheat. 1. Source Information: - Data was published in : Hong, Z.Q. Immunotherapy Dataset: This dataset contains information about wart treatment results of 90 patients using immunotherapy. 30. Optimal Discriminant Plane for a Small Number of Samples and Design Method of Classifier on the Plane, Pattern Recognition, Vol. Lymphography: This lymphography domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Below are papers that cite this data set, with context shown. This data will help inform research, policy, planning and guideline development work in the breast cancer area. Cervical Cancer Behavior Risk: The dataset contains 19 attributes regarding ca cervix behavior risk with class label is ca_cervix with 1 and 0 as values which means the respondent with and without ca cervix, respectively. For datasets having large N value and substantially big M value such as Splice dataset FocusM takes many hours to terminate. (Restricted access) 21. ! 100. Shoulder Implant X-Ray Manufacturer Classification: 597 de-identified raw X-ray scans of implanted shoulder prostheses from four manufactures. UCI non-small cell lung cancer study highlights advances in targeted drug therapy. This dataset is taken from OpenML - breast-cancer. At UCI Health, our lung cancer specialists are actively involved in multiple research projects. Breast Cancer Wisconsin (Prognostic): Prognostic Wisconsin Breast Cancer Database, 8. 88. Post-Operative Patient: Dataset of patient features, 26. Contraceptive Method Choice: Dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey. 90. and Yang, J.Y. Missing Values? About 11,000 new cases of invasive cervical cancer are diagnosed each year in the U.S. November 5, 2010 “We hope to be able to identify specific genetic changes in lung cancer and treat patients with specific inhibitors that can improve survival rates and quality of life,” says Dr. Ignatius Ou of UC Irvine Medical Center. Breast Cancer Coimbra: Clinical features were observed or measured for 64 patients with breast cancer and 52 healthy controls. Bone marrow transplant: children: The data set describes pediatric patients with several hematologic diseases, who were subject to the unmanipulated allogeneic unrelated donor hematopoietic stem cell transplantation. "-//W3C//DTD HTML 4.01 Transitional//EN\">. The data set would allow common, consistent and high quality breast cancer data to be collected by State and Territory cancer registries and collated nationally. All, Manoranjan Dash and Huan Liu. Estimation of obesity levels based on eating habits and physical condition : This dataset include data for the estimation of obesity levels in individuals from the countries of Mexico, Peru and Colombia, based on their eating habits and physical condition. This dataset had . 49. PubChem Bioassay Data: These highly imbalanced bioassay datasets are from the differing types of screening that can be performed using HTS technology. Shoulder Implant X-Ray Manufacturer Classification: 597 de-identified raw X-ray scans of implanted shoulder prostheses from four manufactures. From UCI Machine Learning Repository. SPECTF Heart: Data on cardiac Single Proton Emission Computed Tomography (SPECT) images. 55. "-//W3C//DTD HTML 4.01 Transitional//EN\">, Classification (116)Regression (20)Clustering (21)Other (11), Multivariate (100)Univariate (9)Sequential (7)Time-Series (12)Text (3)Domain-Theory (3)Other (1), Life Sciences (116)Physical Sciences (35)CS / Engineering (155)Social Sciences (18)Business (29)Game (7)Other (56), Less than 10 (30)10 to 100 (65)Greater than 100 (14), Less than 100 (8)100 to 1000 (65)Greater than 1000 (40), 1. >>> from sklearn.datasets import load_breast_cancer >>> data = load_breast_cancer >>> data. 107. Ecoli: This data contains protein localization sites, 14. 39. The Jupyter notebook ThoracicSurgery contains the main code.. Thoracic_Surgery_Presentation contains the PowerPoint slides presentation.. Thoracic_Surgery_Report contains the project report.. Abstract. (Restricted access), 21. Heart failure clinical records: This dataset contains the medical records of 299 patients who had heart failure, collected during their follow-up period, where each patient profile has 13 clinical features. I have used used different algorithms - ## 1. Variety of graphical features presented. Sample code ID's were removed. PRICAI. which will perform the presumptive diagnosis of two diseases of the urinary system. KEGG Metabolic Reaction Network (Undirected): KEGG Metabolic pathways modeled as un-directed reaction network. (Restricted access) 21. Explore and run machine learning code with Kaggle Notebooks | Using data from Breast Cancer Wisconsin (Diagnostic) Data Set 85. Autistic Spectrum Disorder Screening Data for Children : Children screening data for autism suitable for classification and predictive tasks. This state … 68. 37. Algerian Forest Fires Dataset : The dataset includes 244 instances that regroup a data of two regions of Algeria. 58. In addition, there are 198 patients used for the test set. There is also a binary target column, Target, indicating pneumonia or non-pneumonia. 106. Molecular Biology (Promoter Gene Sequences): E. Coli promoter gene sequences (DNA) with partial domain theory, 22. 46. Data. Each patient classified into two categories: normal and abnormal. This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. KEGG Metabolic Relation Network (Directed): KEGG Metabolic pathways modeled as directed relation network. 45. UCI Health offers an innovative lung cancer screening program that can detect lung cancer at its earliest stage, when it is most treatable. 41. p53 Mutants: The goal is to model mutant p53 transcriptional activity (active vs inactive) based on data extracted from biophysical simulations. 99. This is one of 5 datasets of the NIPS 2003 feature selection challenge. However, the molecular mechanisms of the oncogenic CT45 to trigger carcinogenesis and tumor malignant progression are largely enigmatic. Mushroom: From Audobon Society Field Guide; mushrooms described in terms of physical characteristics; classification: poisonous or edible. Quadruped Mammals: The file animals.c is a data generator of structured instances representing quadruped animals, 28. Cancer Datasets Datasets are collections of data. 80. 40. Number of Instances: 32. 48. Early biomarkers of Parkinson�s disease based on natural connected speech: Predict a pattern of neurodegeneration in the dataset of speech features obtained from patients with early untreated Parkinson’s disease and patients at high risk developing Parkinson’s disease. 65. The data I am going to use to explore feature selection methods is the Breast Cancer Wisconsin (Diagnostic) Dataset: W.N. 77. 69. Discretization should be applied based on expert recommendations; there is an attached file shows how. 71. 112. 81. 2 Lung Cancer Screening Services Lung cancer, the leading cause of U.S. cancer deaths, is best treated when detected early, well before symptoms appear. 108. The features cover demographic information, habits, and historic medical records. Attribute Characteristics: Integer. 116. Small number of training samples of diseased trees, large number for other land cover. The aim is to ensure that the datasets produced for different tumour types have a consistent style and content, and contain all the parameters needed to guide management and prognostication for individual cancers. Diabetic Retinopathy Debrecen Data Set: This dataset contains features extracted from the Messidor image set to predict whether an image contains signs of diabetic retinopathy or not. Area: Life. LSVT Voice Rehabilitation: 126 samples from 14 participants, 309 features. Number of Web Hits: 324188. A single patient's … Simulated Falls and Daily Living Activities Data Set: 20 falls and 16 daily living activities were performed by 17 volunteers with 5 repetitions while wearing 6 sensors (3.060 instances) that attached to their head, chest, waist, wrist, thigh and ankle. Molecular Biology (Splice-junction Gene Sequences): Primate splice-junction gene sequences (DNA) with associated imperfect domain theory, 24. Autistic Spectrum Disorder Screening Data for Adolescent : Autistic Spectrum Disorder Screening Data for Adolescent. Bar Crawl: Detecting Heavy Drinking: Accelerometer and transdermal alcohol content data from a college bar crawl. For each sample, a shape descriptor, fine scale margin and texture histogram are given. Below are papers that cite this data set, with context shown. Cervical cancer (Risk Factors): This dataset focuses on the prediction of indicators/diagnosis of cervical cancer. The Lung Cancer dataset (~2,100, one record per lung cancer) contains information about each lung cancer diagnosed during the trial, including multiple primary tumors in the same individual. Exasens: This repository introduces a novel dataset for the classification of 4 groups of respiratory diseases: Chronic Obstructive Pulmonary Disease (COPD), asthma, infected, and Healthy Controls (HC). 83. 115. The copy of UCI ML Breast Cancer Wisconsin (Diagnostic) dataset is downloaded from: https://goo.gl/U2Uwz2. 52. Wolberg and O.L. Breast Cancer Wisconsin (Original): Original Wisconsin Breast Cancer Database, 7. 62. The goal is to predict the presence of amphibians species near the water reservoirs based on features obtained from GIS systems and satellite images. Z-Alizadeh Sani: It was collected for CAD diagnosis. that is also available in UCI datasets. 36. UCI Health Chao Family Comprehensive Cancer Center now offers special lung cancer screening for those at high risk for developing the disease, including smokers and people exposed to asbestos and other cancer-causing substances. Variety of graphical features presented. lung cancer treatment The UCI Chao Family Comprehensive Cancer Center is a leader in first-in-human trials of targeted therapies for non-small-cell lung cancer for good reason. Tags: cancer, lung, lung cancer, saliva View Dataset Expression profile of lung adenocarcinoma, A549 cells following targeted depletion of non metastatic 2 (NME2/NM23 H2) 32. Examples. Audiology (Standardized): Standardized version of the original audiology database, 5. 66. Lung Cancer: Lung cancer data; no attribute definitions, 20. and registered in the Polish National Cancer Registry. 98. Lung Cancer Data Set Download: Data Folder, Data Set Description. The goal is to map different forest types using spectral data. 102. Rule extraction from Linear Support Vector Machines. Chemical compounds represented by structural molecular features must be classified as active (binding to thrombin) or inactive. What should I expect the data format to be? 20. Statlog (Heart): This dataset is a heart disease database similar to a database already present in the repository (Heart Disease databases) but in a slightly different form. Demospongiae: Marine sponges of the Demospongiae class classification domain. Papers were automatically harvested and associated with this data set, in collaboration with Rexa.info. Term Project on LIDC (Lung Cancer CT Scan) dataset. Mammographic Mass: Discrimination of benign and malignant mammographic masses based on BI-RADS attributes and the patient's age. O.L. Title: Lung Cancer Data 2. 87. Primary Tumor: From Ljubljana Oncology Institute, 27. 105. Used to predict heavy drinking episodes via mobile data. Each patient classified into two categories: normal and abnormal. 12. 54. Cryotherapy Dataset : This dataset contains information about wart treatment results of 90 patients using cryotherapy. Molecular Biology (Protein Secondary Structure): From CMU connectionist bench repository; Classifies secondary structure of certain globular proteins, 23. Jinyan Li and Limsoon Wong. Working for a seminar for Soft Computing as a domain and topic is Early Diagnosis of Lung Cancer. 78. Caesarian Section Classification Dataset: This dataset contains information about caesarian section results of 80 pregnant women with the most important characteristics of delivery problems in the medical field. This is a dataset about breast cancer occurrences. Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone Multivariate, Sequential, Time-Series Classification, Regression, Clustering Thanks go to M. Zwitter and M. Soklic for providing the data. Actually, several reasons. In a pilot study, 100 experiments with four subjects have been performed to study the reproducibility of this technique. 75. Bounding boxes are defined as follows: x-min y-min width height. Haberman's Survival: Dataset contains cases from study conducted on the survival of patients who had undergone surgery for breast cancer, 15. 114. Activity recognition with healthy older people using a batteryless wearable sensor: Sequential motion data from 14 healthy older people aged 66 to 86 years old using a batteryless, wearable sensor on top of their clothing for the recognition of activities in clinical environments.