Solved

How can I conditionally fill down/flash fill NULL values with previous values based on certain criteria?

2 years ago
April 11, 2023
5 replies
1594 views

KCMT
Contributor
33 replies

I would like to be able to flash fill down NULL values in my DSA table with certain conditions.

In the table below I have multiple NULL values. Take for example the column ‘CardCode DUAL’.
Row 2 with Company key MTW and project 1201121979 shows for CardCode DUAL DB0006. I would like to show value DB0006 also for all other rows where company key = MTW and project = 1201121969.

Same for Route Bron column. I would like to fill down NULL values on the most recent NON Blank value for that Company_Key+Project combination.

I think it should be possible with a self join or self select, but not sure how.

Best answer by fwagner

Hi @KCMT,

thanks for your question!

There’s several ways you can achieve the fill down (or “fill forward” or “last non empty”) - four of them are described here:

https://www.andrewvillazon.com/forward-fill-values-t-sql

The most transparent and concise way to achieve this is the LAST_VALUE/FIRST_VALUE functions of T-SQL: https://learn.microsoft.com/en-us/sql/t-sql/functions/last-value-transact-sql?view=sql-server-ver16

SELECT 
    FIRST_VALUE( [CardCode DUAL] ) OVER (
        PARTITION BY [company key], [project]
        ORDER BY some_timestamp_or_id
        ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
    ) AS CardCode_DUAL_FILLED_FORWARD,

    FIRST_VALUE( [Route Bron] ) OVER (
        PARTITION BY [company key], [project]
        ORDER BY some_timestamp_or_id
        ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
    ) AS Route_Bron_FILLED_FORWARD

FROM [actual_table]

Within TimeXtender you would create a custom view that has the filled-forward columns, and then update the actual table from that view.

Please let us know if that info helps you taking the next steps.

View original

Did this topic help you find an answer to your question?

fwagner
Employee
33 replies
Answer
2 years ago
April 11, 2023

Hi @KCMT,

thanks for your question!

There’s several ways you can achieve the fill down (or “fill forward” or “last non empty”) - four of them are described here:

https://www.andrewvillazon.com/forward-fill-values-t-sql

SELECT 
    FIRST_VALUE( [CardCode DUAL] ) OVER (
        PARTITION BY [company key], [project]
        ORDER BY some_timestamp_or_id
        ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
    ) AS CardCode_DUAL_FILLED_FORWARD,

    FIRST_VALUE( [Route Bron] ) OVER (
        PARTITION BY [company key], [project]
        ORDER BY some_timestamp_or_id
        ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
    ) AS Route_Bron_FILLED_FORWARD

FROM [actual_table]

Within TimeXtender you would create a custom view that has the filled-forward columns, and then update the actual table from that view.

Please let us know if that info helps you taking the next steps.

rory.smith
TimeXtender Xpert
707 replies
2 years ago
April 12, 2023

Hi @KCMT and @fwagner ,

the general approach suggested by Frank will work - just be aware of a pitfall: https://sqlperformance.com/2019/08/sql-performance/t-sql-bugs-pitfalls-and-best-practices-window-functions (see under Implicit frame with FIRST_VALUE and LAST_VALUE) to get the correct LAST_VALUE.

You could also load the PK fields + fields to grab the “best” value from as a separate table and use an Aggregate table to find the value you want. Then use the Aggregate table as a lookup source.

Which approach you choose depends on the performance vs. clarity balance you wish to achieve.

fwagner
Employee
33 replies
2 years ago
April 12, 2023

thanks @rory.smith you’re absolutely correct about the balance between clarity and performance :-)

I adjusted the above SQL statement to be more aligned with best practices for anyone who’s looking for a reference

rory.smith
TimeXtender Xpert
707 replies
2 years ago
April 12, 2023

Just a little note: if your data has the filled value you want to propagate in a random records in the subset, you may want to use FIRST_VALUE with IGNORE NULLS. This is new for SQL Server 2022 and Azure SQL DB only.

So in a contrived AW2014 SalesOrderDetail example:

Christian Hauggaard
Community Manager
1162 replies
2 years ago
April 28, 2023

Hi @KCMT did the answers above resolve the issue? If so can you please help us by marking the best answer above? Please let us know if you have any follow up questions

Reply

Rich Text Editor, editor1

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

Reply

Related topics

Contact info, e-mail or phone are not set in other languages in the chat embebed optionicon

❓ Create an interactive FAQ using VideoAsk

"Introducing VideoAsk" Webinar Q&A 🙋

🎙️Lunch & listen: brand new podcast "Angles & Insights" by ActiveCampaign

Keep messages in duplicated Typeforms

Most helpful members this week

Sign up

Login with SSO

Login to the community

Login with SSO

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings