tsql - How to do this in SQL Server query instead of function?

Question

Welcome To Ask or Share your Answers For Others

tsql - How to do this in SQL Server query instead of function?

posted Jan 31, 2022 in Technique[技术] by 深蓝 (71.8m points)

tsql - How to do this in SQL Server query instead of function?

I have a table that has a string in one of its columns.

My table look like this:

RowCnt	Lvl	TargetID	Codes
1000	1	0	1,1,0,1,0,1,...,1,0,0,0,0
1000	1	1	0,0,1,0,1,0,...,0,1,1,1,1
1000	1	2	1,0,0,0,1,1,...,0,0,0,0,0
1000	1	3	0,1,1,1,0,1,...,1,1,1,1,1
1000	1	4	1,1,0,0,1,0,...,0,0,1,0,0
1000	2	0	0,0,1,1,0,1,...,0,1,0,1,1
1000	2	1	0,1,0,1,1,1,...,1,1,1,1,0
1000	2	2	0,0,0,0,0,1,...,0,0,0,0,1
1500	1	0	1,1,1,1,1,0,...,1,1,1,1,0
1500	1	1	1,0,0,0,0,1,...,0,0,0,0,1

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2022-01-31T07:16:10+0000

A faster version of your function would be an inline Table Valued Function.

CREATE OR ALTER FUNCTION dbo.Similar (@x varchar(max), @y varchar(max))
RETURNS TABLE AS RETURN

SELECT COUNT(CASE WHEN xJ.value <> yJ.value THEN 1 END) * 1.0 / COUNT(*) AS Pct
FROM (
    SELECT *,
      ROW_NUMBER() OVER (ORDER BY (SELECT 1)) rn
    FROM STRING_SPLIT(@x, ',')
) xJ
JOIN (
    SELECT *,
      ROW_NUMBER() OVER (ORDER BY (SELECT 1)) rn
    FROM STRING_SPLIT(@y, ',')
) yJ ON yJ.rn = xJ.rn;

However, STRING_SPLIT with a row-number is not guaranteed to always return results in the actual order of the string. It may do it once, it may do it a million times, but there is always a chance the compiler could rearrange things. So instead you could use OPENJSON

CREATE OR ALTER FUNCTION dbo.Similar (@x varchar(max), @y varchar(max))
RETURNS TABLE AS RETURN

SELECT COUNT(CASE WHEN xJ.value <> yJ.value THEN 1 END) * 1.0 / COUNT(*) AS Pct
FROM OPENJSON('[' + @x + ']') xJ
JOIN OPENJSON('[' + @y + ']') yJ ON yJ.[key] = xJ.[key];

You would use it like this

WITH Y AS (
    select
      a.RowCnt,
      a.Lvl,
      a.TargetID a_TargetID,
      b.targetid b_TargetID,
      a.codes a_codes,
      b.codes b_codes,
      sim.Pct sim
    from TargetsComp A
    inner join TargetsComp B
        on a.RowCnt = b.RowCnt 
       and a.TargetID < b.TargetID
    CROSS APPLY dbo.sim(a.codes, b.codes) sim
)
insert into TargetFilled
  (RowCnt, Lvl, a_TargetID, b_TargetID, a_codes, b_codes, sim)
SELECT RowCnt, Lvl, a_TargetID, b_TargetID, a_codes, b_codes, sim
FROM Y;
-- you may want to add
-- WHERE sim.Pct < 100

I have removed the ORDER BY from the insert as I don't think it's necessary.

You should index your table as follows

CLUSTERED INDEX ON TargetsComp (RowCnt, TargetID)

Categories

tsql - How to do this in SQL Server query instead of function?

tsql - How to do this in SQL Server query instead of function?

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags