Learning NoSQL

The emergence of twitter, facebook, Groupon, amazon, etc ... is becoming increasingly popular new forms of large data storage and fast access to them. The terms NoSQL , Big Data , Hadoop ... are rapidly becoming popular.

Not long ago we talked about it:

 

- Comparison of NoSQL Databases ( Cassandra , MongoDB , CouchDB , Redis , Riak and HBase ) 
- Infographics, the potential for big data . 
- Pentaho also is accelerating much its integration with Big Data.

 

Today we give you some more detailed presentations, where we can research:  

Aspects to evaluate the selection of an ETL tool

Addressing a business intelligence project is important to proper assessment of the ETL tool that we will use. The tool based on which we will implement our procurement processes of the Datamart, Datawarehouse or storage structure based on which further exploit the data. It is a cornerstone for the design, construction and subsequent evolution of our BI system. We will analyze technical issues only, without entering into the economic aspects or otherwise (licenses, agreements, technical support, tool changes, etc. ..) . Note that the ETL processes, are closely linked to the processes of data profiling and data quality, here we will not consider.

Translation of terminology Oracle - DB2 LUW

With 9.7 DB2 LUW version, IBM makes a nod to all Oracle DBAs, much more numerous in the DB2 market.

For this reason, 9.7 version has introduced Oracle compatibility modes that let you perform tasks in DB2 with the ease and knowledge that all Oracle DBAs have. However, it is important to know the terminology's translation between Oracle and DB2 if you intend to get into the DB2 world.

In this first article, I relate a number of items from which this introduction is simple and can be read DB2 documentation easily, including general terminology, updates, utilities, and views.

 

Encrypted Tablespaces in Oracle 11g

Since the release 1 of Oracle 11g, Oracle provides the ability to encrypt tablespaces in full, to protect sensitive data inside and accessible from the OS. That is, the objective of this new functionality is not to protect sensitive data of users of the database, but to protect the information of the tablespace datafiles.

 

To explain the usefulness of this feature, it is best to explain situations in which our data without this functionality would be vulnerable. For example, in the case file of a physical backup of a database tablespace were to wrong hands, could see some data "clear" without problems. For example, a single edition of tablespace (or a simple "cat") containing the Employees table, we show clear varchar2 fields can extract sensitive data (you do not believe, try it!).

AWR Formatter

Anyone who usually look at AWR performance reports to analyze Oracle performance problems, often have their own reading process and approach to all data that shows this report, but I always missed a tool that make easier reading all data.

 

In-database analytical solutions. Case SAS - Teradata

Within SAS solutions focused on improving performance and response time analytic environments highlights the effort with the proposed SAS In-Database. This proposal is accompanied by other high-performance solutions such as Grid Computing and SAS In-memory, which also improve performance, help manage the workload and improve the scalability of analytic environments.

 

Help in the definition of KPI's. Library use.

One of the key phases in any Business Intelligence project and in particular the definition of Dashboard is the identification of KPIs (Key Performance Indicators). These metrics that quantify objectives are closely linked to business processes that measure, but there are standard libraries that proposed KPI's and that can help us to define it better. 

 

One of them is: https://www.kpilibrary.com/

 

This library classifies KPI's by sector and business process. Logically, it is necessary to land and adapt to specific business process and performance targets to be achieved at estrategical level, but can be of great help and give ideas to propose indicators derived from those proposed.

 

Example:

 

Within the area of ​​human resources and in particular the formation processes, proposes more than 30 KPIs, as shown:

 

      Average number of training hours per employee

      Average costs per employee training

      Percentage of on-line courses

      Percentage of budget dedicated HR to training

Do your BI platform support the data source you want to analyze?

Although the latest versions of most BI platforms support a wide range of data sources, this is a common question which involved the version of your BI tool, the version of the database file format or ERP that acts as a data source and the operating system.

 

In the case of SAS can solve these questions by referring to the SAS / ACCESS Validation Matrix, selecting the version of SAS, the database, the operating system and you have the answer.

SQL08: Create a conditioned table using the data dictionary of SQL Server

Sometimes we need to check the existence of a table in a script or scheduled task to record error logs, first executions etc ... 

Let us take an example, a package of integration services that normally distribute or run there where we go and that leaves traces in a custom table that is not the default for loading logs. We could always include a performance task or sql script, run right or wrong, as the first to run on the package and then continue. Being purists this just is not quite careful