添加链接
link管理
链接快照平台
  • 输入网页链接,自动生成快照
  • 标签化管理网页链接

Hello,

I'm trying to 'extract' the sql tables/views used in some sql statements.

In Python this is quite easy with sqlparser. So I thought using this in PowerBI/Query.

This is my code (Temporary only 1 row)

let
    Source = Sql.Database("syn-eu2-prd-edw-001.database.windows.net", "syndpeu2prdedw1"),
    SynapsMonitor_SqlStmt = Source{[Schema="rdv_60_169",Item="SynapsMonitor_SqlStmt_V_Just_Stmt"]}[Data],
    #"Replaced Value" = Table.ReplaceValue(SynapsMonitor_SqlStmt,"#(cr)#(lf)"," ",Replacer.ReplaceText,{"statement"}),
    #"Kept First Rows" = Table.FirstN(#"Replaced Value",1),
    #"Run Python script" = Python.Execute("# 'dataset' holds the input data for this script#(lf)#(lf)import sql_metadata as sm#(lf)dataset[""Test""] = '/'.join(sm.Parser('""' + str(dataset[""statement""]) + '""').tables)",[dataset=#"Kept First Rows"]),
    #"Expanded Value" = Table.ExpandTableColumn(#"Run Python script", "Value", {"statement", "SqlIdx", "Test"}, {"Value.statement", "Value.SqlIdx", "Value.Test"})
    #"Expanded Value"

I've tried al posible combinations, this one is ending into a error

DataSource.Error: ADO.NET: Python script error.
<pi>Not supported query type: "0    select r.[ROUTINE_SCHEMA], r.[ROUTINE_NAME], r...
Name: statement, dtype: object"
ValueError: Not supported query type!
Details:
    DataSourceKind=Python
    DataSourcePath=Python
    Message=Python script error.
<pi>Not supported query type: "0    select r.[ROUTINE_SCHEMA], r.[ROUTINE_NAME], r...

Might be caused due to the extra "

But when I just enter field fieldname I'm getting the error

DataSource.Error: ADO.NET: Python script error.
<pi>ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().
Details:
    DataSourceKind=Python
    DataSourcePath=Python
    Message=Python script error.
<pi>ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().
    ErrorCode=-2147467259

This (I think) comes from the pandas problem with AND / OR that in some aricles is changed into & |

But if I change that the SQL statement goes wrong.

Funny thing is that when I use this code all works fine....

let
    Source = Sql.Database("syn-eu2-prd-edw-001.database.windows.net", "syndpeu2prdedw1"),
    SynapsMonitor_SqlStmt = Source{[Schema="rdv_60_169",Item="SynapsMonitor_SqlStmt_V_Just_Stmt"]}[Data],
    #"Replaced Value" = Table.ReplaceValue(SynapsMonitor_SqlStmt,"#(cr)#(lf)"," ",Replacer.ReplaceText,{"statement"}),
    #"Kept First Rows" = Table.FirstN(#"Replaced Value",1),
    #"Run Python script" = Python.Execute("# 'dataset' holds the input data for this script#(lf)#(lf)import sql_metadata as sm#(lf)dataset[""Test""] = '/'.join(sm.Parser(""select r.[ROUTINE_SCHEMA], r.[ROUTINE_NAME], r.[ROUTINE_TYPE], p.create_date [CREATED_DATE], p.modify_date [MODIFIED_DATE], cast(e.value as nvarchar(max)) [DESCRIPTION] from [INFORMATION_SCHEMA].[ROUTINES] r join sys.schemas s on s.name = r.[ROUTINE_SCHEMA] join sys.objects p on p.name = r.[ROUTINE_NAME] and p.schema_id = s.schema_id and p.parent_object_id = 0 left outer join sys.extended_properties e on p.object_id = e.major_id and e.minor_id = 0 and e.class = 1 and e.name = 'MS_Description'"").tables)",[dataset=#"Kept First Rows"]),
    #"Expanded Value" = Table.ExpandTableColumn(#"Run Python script", "Value", {"statement", "SqlIdx", "Test"}, {"Value.statement", "Value.SqlIdx", "Value.Test"})
    #"Expanded Value"

But off course this is not the final wanted result...

Any suggestions ?

Pandas follows the numpy convention of raising an error when you try to convert something to a bool. This happens in a if or when using the boolean operations, and, or, or not. It is not clear what the result of.

example

5 == pd.Series([12,2,5,10])

The result you get is a Series of booleans, equal in size to the pd.Series in the right hand side of the expression. So, you get an error. The problem here is that you are comparing a pandas pd.Series with a value, so you'll have multiple True and multiple False values, as in the case above. This of course is ambiguous, since the condition is neither True or False. You need to further aggregate the result so that a single boolean value results from the operation. For that you'll have to use either any or all depending on whether you want at least one (any) or all values to satisfy the condition.

(5 == pd.Series([12,2,5,10])).all()
# False

or

(5 == pd.Series([12,2,5,10])).any()
# True

Power BI Monthly Update - October 2024

Check out the October 2024 Power BI update to learn about new features.

Microsoft Fabric & AI Learning Hackathon

Learn from experts, get hands-on experience, and win awesome prizes.

Fabric Community Update - October 2024

Find out what's new and trending in the Fabric Community.