Microsoft

How to set-up your Dust Microsoft connection

Overview

You can connect Dust to your Microsoft suite to enable access to the documents, spreadsheets, and presentations hosted in your Sharepoint drives. Only the content within the scope of the admin's Drive permissions will be available in Dust. The admin can granularly select the exact data they want to make available to Dust.

Dust doesn't take into account files with more than ~800Kb of extracted text. Supported files include text files (docx, pptx, .txt...), but also PDFs. You can activate PDF indexation at Spaces > Connected data and click on Manage Microsoft). xlsx files are parsed and each worksheet is available for table queries.

Option 1 : Setting up the Connection with a Service Account

To set up the Microsoft connection, follow these steps:

  1. Navigate to Spaces > Connections > Microsoft

  2. Acknowledge dust access to your Microsoft data

  3. Login to your microsoft account and authorise permission. Only organisational accounts are supported, not personal ones.

    NB : If you're not an admin on your Microsoft workspace, will have to ask your administrator to consent Dust permissions for you. Once you clicked on "Request Admin consent", you will have to wait for the administrator to validate your request in their Azure admin panel (nb : this might take a few minutes before showing up on the Azure admin panel)Once authorised, perform step 3 again. This time around, the connection will finalize.


  4. You can now select the data you want to synchronise with Dust :



Entra ID details

📘

Alternative setup outside of Dust

Entra admins can also decide to create the application in Entra themselves, and will need the following consent URL :

https://login.microsoftonline.com/{{your-entra-tenant-id}}/adminconsent
  ?client_id=04c0e2b8-9852-4fad-9a5c-b97b29e95492
  &scope=https://graph.microsoft.com/Files.Read.All 
         https://graph.microsoft.com/Sites.Read.All 
         https://graph.microsoft.com/User.Read 
         offline_access
  &redirect_uri=https://{{eu. when in Europe}}dust.tt/oauth/microsoft/finalize

When setting up a connection, Dust will create an App in Entra ID called Dust connector. It will have the following permissions, that the Entra Admin will grant either directly by setting up the connection from Dust, or by accepting the Admin consent request


Here is the list of permissions requested

API nameClaim valuePermissionTypeGranted throughGranted by
Microsoft GraphFiles.Read.AllRead all files that user can a...DelegatedAdmin consentAn administrator
Microsoft GraphSites.Read.AllRead items in all site collecti...DelegatedAdmin consentAn administrator
Microsoft GraphUser.ReadSign in and read user profileDelegatedAdmin consentAn administrator
Microsoft Graphoffline_accessMaintain access to data you...DelegatedAdmin consentAn administrator


Managing permissions

  • You should use a dedicated microsoft account (eg. [email protected]) to manage permissions for Dust. This will dictate which sites are visible to the connection in Dust.
    • All Public Sharepoint sites and groups will be seen
    • All Private Sharepoint sites where the dedicated microsoft account is "Member" or more will be seen in the Microsoft connection
    • All Teams channels that the microsoft account is part of (Standard, Shared and Private) will be seen in the Microsoft connection
    • The dedicated microsoft account needs read permission on the files. All files and documents visible by the user will be available for Dust.
📘

Group memberships in Sharepoint

In Sharepoint, access in not always granted by direct addition of the Microsoft account to the member list of the Sharepoint site/group, and is often inherited from group memberships. Make sure that you review the groups that [email protected] belongs to, in order to tightly control what the Dust connection will be able to see




Option 2 : setting the connection with a Service Principal


You can also use a Service Principal to login : fill in your tenant ID, client ID, the secret to use, and optionally the list of sites your service principal has access to, if you're using the Sites.Selected permission.


Step 1 : Create an App Registration in Entra with the following Application permissions on the Microsoft Graph API


API nameClaim valuePermissionTypeGranted throughGranted by
Microsoft GraphFiles.Read.AllRead all files that user can a...ApplicationAdmin consentAn administrator
Microsoft GraphSites.Read.AllRead items in all site collecti...ApplicationAdmin consentAn administrator
Microsoft GraphUser.ReadSign in and read user profileApplicationAdmin consentAn administrator
Microsoft Graphoffline_accessMaintain access to data you...ApplicationAdmin consentAn administrator

Step 2 : In Security & Credentials, create a secret


Step 3 : Fill in the details in Dust




Tenant ID : Directory (tenant) ID from Step 1 Client ID : Application (client) ID from Step 1 Service Account secret : Secret Value from Step 2

Selected SharePoint site(Optional) : Site IDs from Step 6, one per line

NB : In Sharepoint admin, you will need to Select the Sites that your app has access to. Check out the Microsoft documentation


Refresh Rate

A synchronization happens every 5 minutes. All files created, modified or deleted during this period will not be updated immediately, but changes will be reflected only once a new synchronization is executed.

If a lot of files have been added or modified, the synchronization will take longer time.

Labels

Dust will sync the custom columns set on files when used in Sharepoint lists. They will be expose as labels and included in the document above the content itself.

Tags allow for keyword search and filtering while prepending them to the document content will expose them to the semantic search tool.

📘

Custom columns support was added Feb 14 2025

You may need update a document to have it re-synced.