/tag/data analysis

  • Data Analysis Packages on Ivy Linux VM

    Available Packages The following Data Analysis packages are available on the Ivy Linux Virtual Machines
    MATLAB MATrix LABoratory (MATLAB for short) is a software designed for quick scientific calculations, such as matrix manipulation, plotting, and others.
    It has hundreds of built-in functions for a wide variety of computations and several tools designed for specific
    research disciplines, including statistics and partial differential equations.

    • Limited licenses available, for more information on MATLAB and licensing, please click [here] (/userinfo/ivy-linux-sw/data-analysis/matlab)
      SAS SAS is large platform independent software with multiple components, and is used for statistical analysis, data ETL operations, as well as several other

  • Data Analysis Packages on Ivy Windows VM

    Available Packages The following Data Analysis packages are available on the Ivy Windows Virtual Machines
    MATLAB MATrix LABoratory (MATLAB for short) is a software designed for quick scientific calculations, such as matrix manipulation, plotting, and others.
    It has hundreds of built-in functions for a wide variety of computations and several tools designed for specific
    research disciplines, including statistics and partial differential equations.

    • Limited licenses available, for more information on MATLAB and licensing, please click [here] (/userinfo/ivy-windows-sw/data-analysis/matlab)
      SAS SAS is large platform independent software with multiple components, and is used for statistical analysis, data ETL operations, as well as several other

  • UVa Licensed SPSS on Ivy Windows VM

    SPSS Overview SPSS (or Statistical Package for Social Sciences), was initially developed as a social survey project but later on has grown to encompass statistical
    applications in almost all disciplines. Different industries use SPSS for their data analysis work. Its features include database management, reporting,
    graphing, among many others.
    SPSS Usage SPSS is available only on the Windows VM at the moment. To run SPSS go to:
    Start Menu > All Programs > IBM SPSS Statistics Licensing We have a limited number of SPSS licenses available, which are provided on a first-come-first-serve basis.
    As a consequence, availability of SPSS on your VM is not always guaranteed once a VM request is submitted.

  • cTAKES on Ivy Linux VMs

    cTAKES Overview cTAKES or The clinical Text Analysis and Knowledge Extraction System, is a Mayo Clinic developed Natural Language Processing (NLP) tool used to
    extract information out of clinical records. It is open-source, and built on the Apache Unstructured Information Management Architecture. cTAKES
    is modular, expandable, for a number of generic use cases, and contains excellent best practice notes.
    cTAKES Usage cTAKES components Some of cTAKES components are listed below:
    Sentence boundary detection (OpenNLP technology)
    *Tokenization (rule-based) Morphologic normalization (NLM’s LVG) POS tagging (OpenNLP technology) Shallow parsing (OpenNLP technology) Named Entity Recognition Negation and context identification (both based on NegEx) cTAKES Named Entities cTAKES contains the following Named Entities:

  • cTAKES on Ivy Windows VMs

    cTAKES Overview cTAKES or The clinical Text Analysis and Knowledge Extraction System, is a Mayo Clinic developed Natural Language Processing (NLP) tool used to
    extract information out of clinical records. It is open-source, and built on the Apache Unstructured Information Management Architecture. cTAKES
    is modular, expandable, for a number of generic use cases, and contains excellent best practice notes.
    cTAKES Usage cTAKES components Some of cTAKES components are listed below:
    Sentence boundary detection (OpenNLP technology)
    *Tokenization (rule-based) Morphologic normalization (NLM’s LVG) POS tagging (OpenNLP technology) Shallow parsing (OpenNLP technology) Named Entity Recognition Negation and context identification (both based on NegEx) cTAKES Named Entities cTAKES contains the following Named Entities:

  • User Licensed Stata on Ivy Linux VMs

    Stata Overview Stata is a graphical data analysis tool developed by StataCorp, and is short for Statiscs and Data. It
    is used in various disciplines, including biomedicine, economics, epidemiology, among others. It is capable
    of performing statistical analysis, simulations, regression, and data management. Besides the standard version
    Stata also ships with the MP version (multi=processing), and SE for large databases.
    Users requesting an installation of Stata are required to provide their own license. Please consult with us before
    requesting an installation.
    You may also request a Stata license from the UVa Software Gateway
    Installing programs from SSC Please first run the following commands to use the proxy:

  • User Licensed Stata on Ivy Windows VMs

    Stata Overview Stata is a graphical data analysis tool developed by StataCorp, and is short for Statiscs and Data. It
    is used in various disciplines, including biomedicine, economics, epidemiology, among others. It is capable
    of performing statistical analysis, simulations, regression, and data management. Besides the standard version
    Stata also ships with the MP version (multi=processing), and SE for large databases.
    Users requesting an installation of Stata are required to provide their own license. Please consult with us before
    requesting an installation.
    You may also request a Stata license from the UVa Software Gateway
    Installing programs from SSC Please first run the following commands to use the proxy:

  • UVa Licensed IDL on Ivy Linux VMs

    IDL Overview IDL, short for Interactive Data Language, is an interactive shell based data analysis programming language. Used vastly in medical imaging, it can quickly create visualizations and graphs
    of large data sets in a few easy steps due to its vector nature. FORTRAN users would be familiar with the IDL syntax. IDL is not to be confused with
    Java IDL or Microsoft IDL.
    Basic IDL Usage To start IDL, open a terminal window and type idl. This will start the interactive shell.
    Variables in IDL To initialize variables in IDL, type:
    <variable_name> = <variable_value> e.g.
    x = 3 and hit Enter/Return

  • UVa Licensed IDL on Ivy Windows VMs

    IDL Overview IDL, short for Interactive Data Language, is an interactive shell based data analysis programming language. Used vastly in medical imaging, it can quickly create visualizations and graphs
    of large data sets in a few easy steps due to its vector nature. FORTRAN users would be familiar with the IDL syntax. IDL is not to be confused with
    Java IDL or Microsoft IDL.
    Licensing We have a limited number of IDL Licenses available, which are provided on a first-come-first-serve basis.
    As a consequence, availability of IDL on your VM is not always guaranteed once a VM request is submitted.

  • UVa Licensed MATLAB on Ivy Linux Virtual Machines

    MATLAB Overview MATLAB is a high-performance language for technical computing. It integrates
    computation, visualization, and programming environment. MATLAB stands for MATrix LABoratory. MATLAB was made
    to provide easy access to matrix software developed by the LINPACK (linear system package)
    and EISPACK (Eigen system package) projects. MATLAB includes a programming language
    environment with built-in editing and debugging tools, and supports object-oriented programming.
    Programming in MATLAB MATLAB has many advantages compared to conventional computer languages (e.g.,
    C, FORTRAN) for solving technical problems. MATLAB is an interactive system whose
    basic data element is an array, and almost all problems can be solved in MATLAB using that

  • UVa Licensed MATLAB on Ivy Windows Virtual Machines

    MATLAB Overview MATLAB is a high-performance language for technical computing. It integrates
    computation, visualization, and programming environment. MATLAB stands for MATrix LABoratory. MATLAB was made
    to provide easy access to matrix software developed by the LINPACK (linear system package)
    and EISPACK (Eigen system package) projects. MATLAB includes a programming language
    environment with built-in editing and debugging tools, and supports object-oriented programming.
    Programming in MATLAB MATLAB has many advantages compared to conventional computer languages (e.g.,
    C, FORTRAN) for solving technical problems. MATLAB is an interactive system whose
    basic data element is an array, and almost all problems can be solved in MATLAB using that

  • UVa Licensed SAS on the Ivy Linux VM

    SAS Overview SAS is a command-driven software package used for statistical analysis
    and data visualization. It is available in .
    It is one of the most widely used statistical software packages in both industry and academia.
    You may use it if you have a large number of statistical algorithms. It is not limited to an industry,
    and could be used in both scientific and non-scientific contexts. We only offer the Teaching & Research version
    at the moment.
    Common Usage For this example we will use a common scenario from SAS Clinical Standards Toolkit, which is used for supporting clinical
    research activiites.

  • UVa Licensed SAS on the Ivy Windows VM

    SAS Overview SAS is a command-driven software package used for statistical analysis
    and data visualization. It is available in .
    It is one of the most widely used statistical software packages in both industry and academia.
    You may use it if you have a large number of statistical algorithms. It is not limited to an industry,
    and could be used in both scientific and non-scientific contexts. We only offer the Teaching & Research version
    at the moment.
    Common Usage For this example we will use a common scenario from SAS Clinical Standards Toolkit, which is used for supporting clinical
    research activiites.