Tag Archives: Analytics

Power BI Grouping

Grouping means combining multiple items into fewer items. It helps us consolidate and summarize things to understand them at a higher level of granularity. Let us see how to use Power BI Grouping done easily and quickly. This is useful for Ageing Analysis, Bin or Bucket Analysis, week / custom date range analysis.

Creating Power BI Grouping

Power BI provides a simple and consistent mechanism of performing grouping for numbers, text and dates. All that you need to do is Right Click on the field name and choose Create Group.

New Group menu option

Two types of grouping is possible. Bins based or List based.

List based Power BI Grouping

For text data we can only group based upon the List type. It basically allows you to select two or more items and group them into a single category. Ungrouped items are shown as Other category.  For example here, we have divided the Segments into Risky and Safe.

Grouping text - risky and safe segmentsimage

Of course, if a new segment appears in future data, it will go under Others. You will need to edit groupings to repair such issues. This type of grouping should be used only if there are few groups and the data (in this case, segments) do not change often.

If there are too many categories and they change often, it may make more sense to have a separate table containing two columns – the segment and the classification. This table can then have a relationship on the Segment column and provide a more flexible and easier to maintain method of grouping.

Group icon

The newly created Group is added as a separate field with a special icon – which looks like two overlapping boxes.

image

Bins based Power BI Grouping

For numeric values, bin based analysis is more common. For example, for payments overdue, we usually perform ageing analysis by 30,60,90, 120 days overdue. This can be done easily using Bin based Power BI Grouping

Bin size setting of  30 for ageing analysis

This gives us ageing analysis easily.

ageing analysis

Ad-hoc grouping of numbers by using the List method is also available if required, but it is not very useful if there are too many numeric values. If there are few numeric values, ad-hoc grouping will be useful. For example, if you have just 10 unique values like a score, we can use List based grouping to classify them into low, medium and high categories.

List grouping for numbers

However, if there are too many numeric values, it is better to create a separate table to categorize numbers.

Dates based Power BI Grouping

Dates based grouping is available by default in Power BI. When you add a date type of column to a visualization, it automatically becomes broken down into Year, Quarter, Month and Days.

Bins can be created by specifying the size (as we saw with numbers)

image

This is very useful if you want weekly groups. By default, Power BI does not give you weekly breakdown. But if you group by days and choose bin size of 7, you get weekly grouping.

Number of bins

By default we create bins by specifying its bin or bucket size. But we can also use another method – by specifying the number of bins. This is very simple. The actual min and max of the column is found out and the range is divided into specified number of bins.

image

Custom bins require separate tables

Any custom bins require a separate approach.

For example, we want ageing from 0 to 30 and 31 to 45 then 46 to 90… this type of custom bin analysis requires you to create a separate table and then do the math by add a calculated column.

Similarly, for custom date grouping like custom weeks, calendar vs fiscal years and so on, we need a custom date table.

Power BI Workshop

Saturday, 26 May, 9 am to 5 pm, Mumbai

Waterstones Hotel, behind The Lalit Hotel, near International Airport, Mumbai
INR 20,000 + taxes per participant

Limited seats. Register Here

Amazing Value
  1. Full, 6-hour video recording of the entire session which can be used to train more people internally
  2. Email support for 2 months
  3. Practice Exercises with sample files
  4. Online, 2-hour follow up session
  5. Work with your own data

You must bring your own laptop. Additional software needs to be installed BEFORE arriving for the event.

Power BI Workshop Content
  1. Disadvantages / Pitfalls of data analysis in traditional Excel
  2. Understanding the Data Model
  3. Good vs Bad data
  4. Good data checklist
  5. Importing and cleaning up data using Power Query
  6. Get and Transform features
  7. Web data import and live refresh
  8. Combining data from files in a folder
  9. Combining data from multiple sheets
  10. Eliminating the use of VLOOKUP
  11. Large data handling using Power Pivot
  12. Creating dashboards using Power View
  13. Power BI Desktop usage scenarios
  14. Power BI portal
  15. Choosing the right visualization
  16. 3D maps using Power Map
  17. Sharing reports
  18. English language Q&A
  19. Bookmarks and storytelling
  20. Custom visuals
Who should Attend?
  • Anyone who spends lot of time with Excel creating reports, charts and analyzing data.
  • If you use any other BI tools like Tableu, Qlik, BO, Cognos, etc.
  • If you work extensively with Pivot Tables
Who should NOT attend?

If you want to learn DAX language, specifically, you should NOT attend this workshop.
This workshop is designed for people who have no exposure to Power BI in the past. The objective is to migrate them from the painful manual processes and legacy charts to the modern process of data visualization and interactive analysis using Power BI.

Learn from the Guru

Dr Nitin Photo for Power BI Workshop

Dr. Nitin Paranjape has coached over 330,000+ professionals across 18 countries. He is recognized as the Most Valuable Professional by Microsoft, for 15 years. Dr. Nitin is a coveted speaker with a unique humorous and motivating style of teaching. Detailed profile

Dr. Nitin’s work on Analytics

50+ Data Analytics  Blog articles
Power BI video for beginners
Shadow Analytics video for CIOs
Data Analytics Webinar
Power BI for developers video

You will be able to …
  1. Create your own interactive dashboards using Power BI
  2. Understand data quality problems and clean up data efficiently
  3. Combine data from multiple sources and create an optimal data model
  4. Choose the right visualizations to represent data in a meaningful way
  5. Design interactions within the data to simplify and amplify the usefulness of data
Venue

Waterstone Hotel
Behind The Lalit, Near International Airport, Mumbai.
Valet Parking Available

Power BI Workshop is subject to cancellation due to inadequate number of participants or factors beyond our control.

In case of cancellation, full refund will be provided.

Is Power BI Free ?

This is a commonly asked question. I will try to answer it in the simplest possible manner. Of course, this is as of May 2018. Things change very fast. So please check online for the latest status. Power BI Free does exist. In two forms. One is built into Excel and one is a subscription option.

Power BI Free in Excel

Let us understand what Power BI itself is. In simple terms it is a new way  (now 8 years old) of handling and analyzing data. It was created to overcome limitations of Excel and provide modern ways of analyzing information – visually as well as statistically.

Power BI was originally created as a set of add-ins to Excel. Even today, these continue to be add-ins. Power BI is more of a brand name in the context of Excel. You will NOT see a menu called Power BI. But it consists of the following components:

  1. Power Query – which is a method of importing and cleaning up data from 70+ sources. This tool revolutionizes the way in which we managed raw data. All the manual and tedious work, weird custom macros … all gone! What’s more, it allows you to refresh data by repeating the import and clean-up steps automatically. Which means, second time onwards, the effort is zero.  Power Query comes in two forms:
    1. Manually installed Add-in – for Excel 2010 and 2013.
      Here the add-in creates a separate tab (Menu) called Power Query. It has all the query handling options.
    2. Excel 2016 onwards, this add-in is pre-installed. The separate menu is removed. It now lives in the Data tab as a group called Get and Transform.
  2. Power Pivot this has two components
    1. Data Model – a database built-in to Excel which can handle millions of rows of data and multiple tables. It can be used to import data from multiple sources. Data Model compresses data and stores it inside the same XLSX file. It works much faster than having the data in Excel sheets (on the same PC).
    2. Power Pivot Table – which works on the data model (instead of data in Excel sheets). Creating a Pivot Table from data model allows us to use multiple tables of data and create relationships between them.
  3. Power View – is an add-in available since 2013. This is used to create interactive and visual dashboards using the data model. It is extremely flexible and powerful. It provides bar / column charts, matrix (like pivot table), pie charts, scatter charts and map (geographical visualization).
    This tool requires an outdated component called Silverlight to be installed on the PC. As of now, most PCs do have this component installed. But Microsoft itself has discontinued this component and it is not supported. Due to this, Power View becomes a limitation of sorts.
    What’s more, to view and interact with the dashboards on a browser, we need the same Silverlight component. Unfortunately, this works only on Internet Explorer – which is almost phased out globally.
    Due to these limitations, Microsoft is discouraging the usage of Power View. As a manifestation of this issue, the Power View button in 2016 is hidden. We have to customize the ribbon to add the Power View button.
    Although Power View is working as of now, it may be discontinued in future. Many organizations are disabling the use of Silverlight due to security risks associated with it.

Sharing Excel Power BI reports

Sharing reports created in Excel is done in two ways:

  1. Send the file by mail – which can lead to issues because we are making multiple copies of the file. If files exceed the email size limit, this option is not available. Unfortunately, we have already found the workaround for this limitation. We just copy paste the visualizations in PowerPoint and send the reports. This is the same as what we have been doing for decades: Copy paste Pivots and charts from Excel to PowerPoint.
    Although this works, it defeats the purpose because all the interactivity and flexibility of analytics is completely lost.
  2. We can store the Excel files containing Power Pivots and Power View on OneDrive and use Internet Explorer to view and interact with the reports. As discussed earlier, it requires Silverlight and is a risky option.

The REAL Power BI Free

Although Power BI started as individual component Add-ins, Microsoft wanted to provide a more cohesive and safe solution. That is why Microsoft combined the above components into a separate tool and created Power BI Desktop.

Power BI Desktop is a stand-alone application. It does not need Excel to be installed. It is a combination of functions provided by Power Query, Power Pivot and Power View.

It can connect to 70+ data sources (as of May 2018) including Excel files, CSV, TXT and databases. Power BI Desktop can be used for free by anyone after registering on the Power BI web site.

Using this Power BI is simple. Download the Power BI Desktop tool. Create reports on the PC and publish them on the Power BI Portal (web site). Thus far it is free.

Sharing it with others requires the paid version. Detailed comparison can be seen here.

Sharing the PBIX file

Power BI Desktop is used to create reports and the Power BI Portal (website) is used to publish and share reports. Sharing requires a paid (PRO) subscription.

However, smart users will realize that it is still possible to share the Power BI Reports by either mailing the PBIX files or storing them on some cloud platform.

While this is technically possible, it suffers from the same disadvantages which Excel based sharing suffers from. If the data is large, the file size exceeds email limit. Even if you save the file on cloud and share the link, the other users have to download the file before opening and viewing it. Furthermore, they can edit everything in the file including the report visualizations, filters and interactions.

This basically means, the original report is no longer the original report – leading to multiple versions and confusion.

Feel free to post your queries here.

Power BI Free Logo

Transforming Shadow Analytics

This content is relevant only if you are a CIO (or IT decision maker). Here is the video of the session I conducted at CIO Power List event on 4th May, 2018, at Conrad, Pune. Shadow Analytics has been around ever since “shadows” – also called end users – are around. Everyone knows about. Some people tried to eliminate it. Nobody succeeded.

This 30 minute video explains how to use Shadow Analytics as an opportunity to empower rather than restrict users and improve effective utilization of data.

Demos Included

The demos included in this Shadow Analytics video are:
Flash Fill, Insights, Explain the increase and Q&A.

Insights option in Insert menu of Excel

What is Shadow Analytics?

It is all kinds of data capture, clean-up, manipulation and report generation performed by end users without IT intervention.

If you generate a report from a business system (which is built or managed by IT), it is alright. But if you copy paste data from multiple such reports into Excel and then generate a new report, it becomes “Shadow Analytics”.

As you can imagine, it is difficult to eliminate it. Irrespective of how much time and effort you have spent on creating the most flexible ad-hoc reporting systems, it is impossible to provide every possible variation that users want. Therefore, Shadow Analytics has always been there and is likely to survive in the foreseeable future.

Problems associated with Shadow Analytics

Primarily two problems. It is extremely error prone and time consuming. There are lots of related problems. The root cause is that data is handled in a casual manner without regard for its recency and in a completely undocumented manner.

This can lead to wrong decisions, delayed decisions, increased operational risk and enormous wastage of precious time.

Summary

It is impossible to handle and correct the data sources and deliver data to users in a manner which is so easy that they stop doing the manual capture and clean-up altogether.

Once clean, accurate and updated data is available as input, creating reports can be done by end users in a more informed and productive manner.

Left hug Computer Right hug

Split column into rows

A brilliant new feature is now available in Power BI – Split column into rows. To understand why we need it, you must go and read the article  – Analyzing badly captured Survey data or feedback forms. This method used Power Query concepts of Split and Unpivot. Now these have been combined into a single, intelligent command called Split columns into rows.  It sounds confusing at first. But soon you will realize that it is an amazing tool.  Learn it just 4 minutes.

Raw data looks like this

Split column into rows - raw data

And you get a report like this. No need to use formulas or do any manual work.

Split column into rows - Final Charts

You must have the May 2017 update for Power BI Desktop installed.

Continue reading Split column into rows

Power BI Dates before 1900

This is a short post. It is like an FYI mail. Excel never understood any dates before 1900. We got used to that limitation over the decades. But Power BI does understand Dates before 1900. The best part is, you do not have to take any specific action. It just works.

Here is the raw data and the Power BI output.

Dates before 1900 - raw dataDates before 1900 - in a chart

If you try this in Excel, it just will not work. Now that you know this, starting using Power BI with Dates before 1900.

Mind you, the Power BI documentation says that the earliest limit is 1900. It still works for dates before 1900. Drill down is also supported. Here is the same data at Day level.

Drill Down

This ability  may make historians and archeologists partially happy. There time scales are huge and Power BI does not support that much of a range. But still, it is an improvement worth knowing about.

Consolidate Cross-Tab Data in minutes

Cross-tab data is where you have row headings as well as column headings. Consolidating (combining) multiple blocks of such data is a laborious process. Usually the rows and columns do not match across data blocks. You need to do a lot of copy-paste to bring them in the correct order before you can add a formula. Using a 20 year old unknown but powerful feature of Excel you can Consolidate Cross-Tab Data in minutes. You just select the data blocks and let Excel do the dirty work of matching row and column names.

Additional Resources : Consolidate Cross-Tab Data

I have already covered this topic as a step-by-step article.  Now I have created a video (9 min). You can download the sample file used in the video and try it yourself.

Continue reading Consolidate Cross-Tab Data in minutes

Pivot Report Sharing

Here is a fast paced, 8 minute video which will revolutionize Pivot Report Sharing. Using OneDrive or SharePoint, control the visibility and provide full interactivity. Reports render instantly even with large files and low bandwidth.

Pivot Report Sharing articles

New, Improved way of delivering Excel reports: Part 1
New, Improved way of delivering Excel reports: Part 2
New, Improved way of publishing Excel reports: Part 3