data:image/s3,"s3://crabby-images/3cdc0/3cdc0d454ace1ef7c04d4a6cf336abc4f1512c8c" alt="SQL Server 2017 Machine Learning Services with R"
The Microsoft Machine Learning R Services architecture
The architecture of R Server covers many components needed in order for the communication between R IDE or SQL Server and R engine to work properly.
Several components are involved in order to properly execute Transact SQL, R script, and return all the results back to T-SQL:
data:image/s3,"s3://crabby-images/185f6/185f6de1c3b3d045667093e8218f3ef78077e70a" alt=""
Launchpad is a new service in SQL Server 2016 that supports execution of external scripts using the external stored procedure from SQL Server. However, in SQL Server 2017, the Python launcher has also been introduced, making Launchpad generally available to the second non-SQL Server language. The idea behind Launchpad is that since the infrastructure is already prepared, the SQL Server should, in the future, support other languages as well, such as JavaScript and C++, opening this service not only to predictive analytics and machine learning, but also to other standalone languages.
The Launchpad service (service name: MSSQLLaunchpad) is dependent on the SQL Server service (service name: MSSQLSERVER), which means that, when restarting or stopping a SQL Server service, the Launchpad service will be restarted or stopped also.
In addition, the R language has also brought two additional services: R Terminal service (abbreviated as RTerm) and BxlServer, which use SQL Satellite, an additional extension to SQL server that communicates between external runtime (correlating to the Launchpad service) and SQL Server; both can be found as processes.
For easier understanding of these correlations, let's start a simple PowerShell script to see all the components of Launchpad and BxlServer:
Get-Process -Name Launchpad | Select-Object * Get-Process -Name BxlServer | Select-Object *
Additionally, with a simple T-SQL code, we can include the R script:
EXEC sp_execute_external_script @language = N'R' ,@script = N'library(RevoScaleR) OutputDataSet <- data.frame(rxInstalledPackages())'
Additionally, observe the Process Explorer and Launchpad Process:
data:image/s3,"s3://crabby-images/fa12c/fa12c4f3d39a990047f6506cb69f112db1c6c908" alt=""
It is obvious how communication creates the entire circle from SQL Server to R and back.
Furthermore, the Command Line also reveals that the Launchpad process uses the RLauncher.dll library and launches sqlsatelitelaunch using sqlsatellite.dll:
CommandLine: "C:\Program Files\Microsoft SQL Server\MSSQL14.MSSQLSERVER\MSSQL\bin\launchpad.exe" -launcher RLauncher.dll -pipename sqlsatellitelaunch -timeout 600000 -logPath "C:\Program Files\Microsoft SQL Server\MSSQL14.MSSQLSERVER\MSSQL\LOG\ExtensibilityLog" -workingDir "C:\Program Files\Microsoft SQL Server\MSSQL14.MSSQLSERVER\MSSQL\ExtensibilityData" -satelliteDllPath
"C:\Program Files\Microsoft SQL Server\MSSQL14.MSSQLSERVER\MSSQL\Binn\sqlsatellite.dll"
Many more configurations will tell you where and how communication in this architecture is set up. Opening the RLauncher.config file will show you the following configuration:
data:image/s3,"s3://crabby-images/66ebc/66ebc3a0440b31764baf8c965809ecd7682198b3" alt=""
Additionally, by changing the TRACE_LEVEL parameter, it can also reveal potential logs, showing same path and communication between the systems, as the RevoScaleR log also shows information on each worker session:
library(RevoScaleR); sessionDirectory <- 'C:\\PROGRA~1\\MICROS~3\\MSSQL1~1.MSS\\MSSQL\\EXTENS~1\\MSSQLSERVER01\\0FEB1004-867F-4AB6-B9CC-E1C824596645';sessionId <- '0FEB1004-867F-4AB6-B9CC-E1C824596645';scriptFile <- file.path(sessionDirectory, paste(sessionId, '.R', sep=''));rxIgnoreCallResult <- .Call('RxSqlSessionStart', list(sessionDirectory=sessionDirectory, sessionId=sessionId, waitTime=-1));source(scriptFile)