Skip to main content

Solving Data Type Conflicts in Power BI and SQL

Whether we like it or not, error messages are part of our lives. The other day, I faced one of those little horrid yellow messages. For me, it's impossible to stay chill when I see that message. For a second, I feel alarmed and panicked. Let me show you what I'm talking about.


The first column is the customer code, quite self-explanatory. The Registration Date column shows the date when the customer registered to my shop. The First Sale Date column shows the date when the customer bought something for the first time. And this is what I'm trying to see in the Sales Day column: If Registration Date and First Sale Date are the same, meaning customer bought something at the same day customer registered to my shop, print  "First Day", otherwise show the data of First Sale Date column. But computer says no! It says "Expressions that yield variant data-type cannot be used to define calculated columns." If I need to translate it in simplified English, it means: There can't be data with different data-types in one column. I get the error because "First Day"'s data type is string, whereas the data type of First Sale Date column is date. If the data type of my Sales Day column is going to be text, then it can't show date values, if it's going to be date, then it can't show text value. In this situation, I have no other choice but to choose a data type that can show both values. How am I going to do it then?


This is how it's solved. The data type that can show both date and text is text. What I've done in the formula is: 1. Convert the data types of Registration Date and First Sale Date columns from date to text 2. If Registration Date and First Sale Date are the same, print  "First Day", otherwise show the data of First Sale Date column. It was a successful operation and this time my formula worked.

My next question is how can we do it in SQL? Actually, we have three different ways to do it in SQL by using FORMAT, CAST and CONVERT functions. Below is the code which is not going to work.
UPDATE
       CUSTOMERS
       SET SalesDay =
       (CASE WHEN RegistrationDate = FirstSaleDay
       THEN 'First Day'
       ELSE FirstSaleDay
       END)

And this is SQL'S way of saying that tere can't be data with different data-types in one column.




Both error messages look different from each other but they are complaining about exactly the same issue. 

To enter data in SalesDayFORMAT, SalesDayCONVERT, and SalesDayCAST columns, I've used these formulas:

UPDATE
       CUSTOMERS
       SET SalesDayFORMAT =
       (CASE WHEN FORMAT(RegistrationDate, 'yyyy-MM-dd')  = FORMAT(FirstSaleDay, 'yyyy-MM-dd') 
       THEN 'First Day'
       ELSE FORMAT(FirstSaleDay, 'yyyy-MM-dd')
       END
UPDATE
       CUSTOMERS
       SET SalesDayCONVERT =
       (CASE WHEN CONVERT(VARCHAR,RegistrationDate, 121)  = CONVERT(VARCHAR,FirstSaleDay, 121) 
       THEN 'First Day'
       ELSE CONVERT(VARCHAR, FirstSaleDay, 121)
       END)
UPDATE
       CUSTOMERS
       SET SalesDayCAST =
       (CASE WHEN CAST(RegistrationDate AS VARCHAR)  = CAST(FirstSaleDay AS VARCHAR) 
       THEN 'First Day'
       ELSE CAST(FirstSaleDay AS VARCHAR)
       END)

All of them give the same result but which one should we prefer? Well, Microsoft says: 'Use the FORMAT function for locale-aware formatting of date/time and number values as strings. For general data type conversions, use CAST or CONVERT.' Format function offers more flexibility than others. While CONVERT and FORMAT accept an optional style parameter, CAST doesn't give that option. What makes CAST preferable is it's easier to read.

That's it. These are our options to solve the problem. I don't know you, but I would stick with Microsoft's advice and use FORMAT if I had faced the issue in SQL. What would you do? If you have any other idea or feel like sharing what do you think, please comment below.

Comments

Popular posts from this blog

Manipulating Grand Totals with ISINSCOPE in Power BI

ISINSCOPE function was introduced in with November 2018 Power BI update. We can say it's a very young player in the game, but it's definitely magic. It can be used to manage lots of issues related to hierarchies, but in this post, I'm going to focus on manipulating the grand total line with it.  This is my budget forecast table. The budget forecast that you see for every month is not the forecast for every month, but it's the forecast for the whole year. The budget forecast for 2020 has changed every month. That's why we have different numbers for every row. Below you see the table visual with the year-month column and a measure that shows the values for every month. The thing that I want you to pay attention to is the total line. I wanted to see the last month's value for the total line and this is the measure for that.  This measure calculates this: If the year-month column has one value then take the average of that value. If it doesn't have one value, li...

How to Add Flags to Power BI Reports

Adding flags to Power BI Reports might be easier said than done. If you have a dataset which contains data of various countries, adding country flags looks cool. The process to do it, on the other hand, might be compelling. Now I am going to share with you the obstacles I  have faced during the process and of course, how I overcame them. First I made an excel list of countries that my dataset has. Then I have downloaded to my computer all the flags I need from Wikipedia and converted them to Base64 link via an image encoder website  base64-image.de  and added those links to my excel file, which looks like this: After I added my excel file to Power BI, I converted the data category of URL column from 'uncategorized' to 'image URL', which might be a small step for you but it's a big step for the success of the flag operation. Then I created one to many relationship with DimLocation table and chose 'both' as filter direction, so that I can filter from both t...