webapp logo

DAMEWARE

Web Application REsource of DAME

This page is the entry point to the DAMEWARE (Web Application REsource of DAME) (beta release) specialized for data mining on massive data sets. It is a toolset of machine learning models to manage and explore data in various formats.
In this page the users can obtain news, documentation, dataset samples and technical support about the web application.



Release Notes

  • The beta release is an evolution of the alpha version, tested few months ago by a selected team to verify main features and to evaluate their performances. So far, main efforts have been focused to integrate suggestions and issues coming from the alpha test users.
  • In this release has been increased the number of machine learning models: a double version of the Multi Layer Perceptron with Back Propagation (MLP) or Genetic Algorithms (MLPGA) as learning engines and Support Vector Machines (SVM) with several kernels, usable to perform experiments, such as crispy or multiple classification and non-linear regression.
  • Data manipulation features as well as navigation through application workspaces and tags have been revised in order to be more reliable and easy to use.
  • The menu options have been completely renewed, providing more interesting topics about the application.
  • New users have now an easy and safe registration procedure, in order to obtain through e-mail their personal access account in an automatic way.



User Documentation Package

back to top page



Test Resources

This information is for users who intend to contribute to test the beta release.

Below there is a double version of the test report document, to be used by testing people. Please follow instructions inside to report your own tests to the developers.

The following are some data files you can use during experiments for Training, Test or Run cases:

  • xor.csv (CSV format, 2 input columns + target column), usable as Training/Test input file;
  • xor_run.csv (CSV format, 2 input columns), usable as Run input file;
  • test.dat (ASCII format, 4 input columns), usable as Run input file;
  • train.dat (ASCII format, 5 columns, 4 input + 1 target), usable as Training/Test input file;
  • train.fits (FITS format, 5 columns, same of train.dat), usable as Training/Test input file;
  • test.fits (FITS format, 4 columns, same of test.dat), usable as Run input file;
  • train.csv (CSV format, 5 columns, same of train.dat), usable as Training/Test input file;
  • train.votable (VOTable format, 5 columns, same of train.dat), usable as Training/Test input file;
  • dataset_training.dat (ASCII format, 5 columns, 4 input + 1 target), usable as Training/Test input;
  • dataset_run.dat (ASCII format, 4 columns), usable as Run input;
  • dataset_training.fits (FITS format, same of dataset_training.dat), usable as Training/Test input;
  • dataset_run.fits (FITS format, same of dataset_run.dat), usable as Run input;

Moreover, following links can be useful to collect and extract generic datasets for both classification or regression experiments:

back to top page



Frequently Asked Questions

Work in progress!

  1. How can I have the access to the web application?
    • The beta release provides a registration procedure, which consists of a form to be filled in and sent to the administrator. Immediately after you will receive a welcome message to your specified e-mail address and after (within max 36 hours) another message with the confirmation of your registration and related private info to access the application.
  2. Sometimes, after operations in new tabs, coming back in the Resource Manager, the previously selected workspace is no more highlighted. Is it normal?
    • yes. In this cases you have to re-select the Workspace to see its contents. It is due to internal refresh mechanisms.
  3. Which is the max length for workspace's name?
    • 15 characters. No spaces and special chars are allowed
  4. Is it possible to create two workspaces with the same name?
    • No, it isn't! Workspaces belonging to the same user must always have different names.
  5. In order to operate on dataset editing, is it possible to perform multiple edit options before to save the final dataset file?
    • No!, The dataset modification can be executed step by step. A sequence of editing options is allowed, but performed one at a time. Each time you apply an option to modify the dataset, a new file is created and stored in the Workspace file list with the suffix recalling the selected option. If you want to make another modification you have to load the stored new dataset file for the next editing.
  6. If I operate with DAME application with Firefox browser, the cursor of the textbox for the "Upload file from URI" or "Upload file from HD", seems to be blocked.
    • Yes, sometimes, if you try to move the cursor along the textbox, it doesn't work! It is an internal problem of Firefox browser. There's nothing we can do.
  7. How can I cancel a previously created workspace?
    • In order to completely cancel a workspace, you have preliminarly empty the related experiment and file lists, otherwise the system will not allow to cancel it.
  8. Is it possible to move data files from one Workspace to another?
    • NOT Yet! Currently it is possible to move (copy) output files of an experiment into the input file list of the same Workspace. This is one of the future improvements foreseen in the next releases. The alternative procedure could be to download on user local Hard Disk the file and to upload it in the webapp in the file list of the desired workspace.
  9. Is it possibile to use and handle data files in ARFF format (.ARFF)?
    • In principle yes. But in some cases there could be a problem that causes the experiment failure. The reason is because ARFF files are not between the standard supported file types. In the next release it will be.
  10. -

back to top page



Who is who in the DAMEWARE development

  • Massimo Brescia (Project Management & Data Mining Models)
  • Stefano Cavuoti (Project Engineering & Data Mining Models)
  • Raffaele D'Abrusco (Science Support)
  • Giovanni d'Angelo (Driver)
  • Alessandro Di Guido (Data Mining Models)
  • Michelangelo Fiore (Framework)
  • Mauro Garofalo (Project Engineering, GUI & Infrastructure)
  • Omar Laurino (Project Engineering)
  • Giuseppe Longo (PI & Science Support)
  • Francesco Manna (Front End)
  • Alfonso Nocella (Project Engineering, Framework, Database & Infrastructure)
  • Bojan Skordovski (Data Mining Models)

back to top page




GO to DAMEWARE


News:

The last test session of the beta release (before the public release) has started!

More news in the next few days...

Click below to access to the DAMEWARE application

GO


The user documentation is now available on this page.

Check periodically for updates!


Resources:


matrix

Drawing of a full cylinder

Leonardo da Vinci

De Divina Proportione, Luca Pacioli, Milan, 1497



Science Cases
[+] Photometric redshifts

[+] Photometric Quasar candidates

[+] Globular Clusters search

[+] Transient classification

[+] Image segmentation

Functionalities
[+] Classification

[+] Regression

Machine Learning Models
[+] MLP with BP

[+] MLP with GAs

[+] SVM

Infrastructure
[+] Overview

[+] Cloud Environment

[+] Grid Environment

Software Overview
[+] GUI & Front End

[+] Framework

[+] Database

[+] Model Libraries

[+] Driver

Technical Support
[+] helpdame AT gmail.com

[+] Skype service Skype Me™!