Data Governance Policy

What is a Data Governance Policy? Data governance policy is concerned with how an organization collects, stores, accesses and maintains its data.  As data is now a core enterprise asset, ensuring it is properly maintained and controlled is critical. When creating a data governance program there are at least 4 core areas to consider: Data … [Read more…]

Text Mining Loch Ness Monster Sightings

Text Mining with RapidMiner for Loch Ness Monster Sightings Text mining involves pulling root words from text in a system.  In this example, I pulled all of the Loch Ness Monster sightings from 2000 to 2015 from the Official Loch Ness Monster Website into an Excel spreadsheet.  Then using the Text Processing extension processed the … [Read more…]

What is RapidMiner

RapidMiner Overview If you are searching for a data mining solution be sure to look into RapidMiner.  RapidMiner is an open source predictive analytic software that provides great out of the box support to get started with data mining in your organization.  They offer a free desktop software version to get you started.  The basic … [Read more…]

Azure Data Catalog

Azure Data Catalog Now available in public preview is the Azure Data Catalog.  The Data catalog provides an enterprise data repository to enable end users self service data discovery.  The data catalog assists IT and business users by allowing a collaborative solution to publish documented data sets. Users can access the data via an Excel … [Read more…]

HASSUG BI Developer Presentation on 7/14/2015

I presented on how to become a BI developer at the Houston Area SQL Server User Group (HASSUG).  This session resulted from feedback we received in the user group on differences between database developers and BI developers.  If you are interested in seeing the presentation content it is uploaded here: Becoming a bi developer from … [Read more…]