Identity Stores and Directories

March 22, 2026

Directories and identity providers are not the same as workforce source systems or local application account stores, and confusing them creates brittle synchronization and weak ownership.

On this page

Identity stores and directories are the systems that hold, synchronize, and expose identity data for lookup, authentication, and policy use. They are central to IAM, but they are not the whole identity landscape. A workforce system, a partner directory, an identity provider, and an application’s local account store may all participate in one access model. Confusing those layers creates brittle synchronization, duplicate identities, and false assumptions about what system is truly authoritative.

This topic matters because many IAM problems begin with the sentence “the directory says so.” The directory may indeed hold a value, but that does not automatically make it the original source of truth. If an employment status comes from HR, an application owner comes from an internal service catalog, and a customer membership comes from the product itself, then the directory is aggregating facts from multiple systems. The architecture must respect that distinction.

Why It Matters

When organizations collapse all identity layers into one mental model, they usually create one of two failures:

they overload one directory or identity provider with facts it does not truly own
they allow local application account stores to drift away from central lifecycle and policy controls

The result is inconsistent access. A user leaves the company but still exists in a SaaS tool because local accounts were never tied back to workforce lifecycle. A contractor’s sponsorship changes in the vendor system but the directory still shows active group membership. A customer record becomes suspended in the product but its external admin identity remains live in a separate tenant store. These are not edge cases. They are common architecture failures.

The diagram below shows the basic relationship between source systems, directories, identity providers, and local stores.

    flowchart LR
	    A["Source systems (HR, vendor, product, CMDB)"] --> B["Directory or identity graph"]
	    B --> C["Identity provider"]
	    B --> D["Governance and policy inputs"]
	    C --> E["Applications and SaaS"]
	    F["Local app account store"] --> E
	    A --> F

What to notice:

source systems originate some facts, while the directory aggregates and distributes them
the identity provider handles authentication and federation but is not necessarily the source of every attribute
local application stores may still exist, so they need boundary and synchronization discipline

The Main Identity Store Layers

Source-of-Truth Systems

These systems originate facts about an identity. For workforce identities, that may be HR or contractor onboarding. For customer identities, it may be the product’s tenant membership model. For machine identities, it may be a platform catalog or infrastructure inventory. Source systems answer questions like:

does this identity exist
what class of identity is it
who owns it
is it active, suspended, or expired

Directories and Identity Graphs

Directories organize and expose identity objects, groups, and attributes in a way that other systems can consume. They are useful because they centralize lookup and often support authentication flows or policy queries. But directories may be downstream of more authoritative sources for many attributes.

Identity Providers

Identity providers handle authentication and federation. They issue sessions, assertions, or tokens to applications and platforms. They often rely on a directory or graph, but their role is not identical to the role of a source system. An identity provider can confirm an identity successfully while still relying on upstream systems for lifecycle meaning.

Application-Local Account Stores

Many applications still keep local users, local roles, or tenant-specific identities. That is not automatically wrong. Local identity can be legitimate for product-specific authorization or customer-admin workflows. The risk appears when local stores become disconnected from central lifecycle and review, or when teams assume SSO alone solved everything.

Example: Modeling Identity Sources and Downstream Consumers

The YAML below shows a simple architecture map that distinguishes origin, directory, and consuming systems.

 1identity_data_model:
 2  workforce_status:
 3    source: hr-system
 4    replicated_to:
 5      - workforce-directory
 6      - identity-provider
 7  contractor_sponsor:
 8    source: vendor-onboarding
 9    replicated_to:
10      - workforce-directory
11  customer_tenant_membership:
12    source: product-tenant-store
13    replicated_to:
14      - customer-identity-provider
15  privileged_admin_eligibility:
16    source: access-governance-system
17    replicated_to:
18      - identity-provider
19      - pam-platform

Code Walkthrough

The value of the example is not the syntax. It is the distinction:

the source is named separately from where the value is copied
not every attribute originates in the directory
downstream systems consume data differently depending on their role

This makes architecture reviews more honest. If a team asks, “Why did this account stay active?” the answer can be traced to the originating system and the synchronization path instead of being blamed generically on “identity.”

Synchronization and Drift Risks

Identity stores create risk when synchronization logic is unclear or slow. Common problems include:

duplicate identities for the same person across multiple stores
attributes replicated without knowing which copy is authoritative
suspended or offboarded users lingering in local SaaS stores
group membership updated centrally while local app permissions remain unchanged

These issues are especially visible in hybrid environments and in older applications with local account models. The right question is not whether centralization is perfect. The right question is whether the architecture clearly defines authority, propagation, and cleanup for each important identity fact.

Common Design Mistakes

Treating the directory as the authoritative source for every identity attribute, even when it only mirrors another system.
Assuming SSO eliminates the need for local account governance in SaaS or product applications.
Allowing applications to invent local identities with no mapping back to a central lifecycle or tenant system.
Copying attributes into multiple stores without documenting which source wins during conflicts.

Design Review Question

An enterprise says its central identity provider is the source of truth for workforce, partner, and customer identity because every major application uses it for sign-in. However, HR status, contractor end dates, and customer tenant membership all originate elsewhere and do not always synchronize cleanly. Is the identity provider truly the source of truth?

Not by itself. It is a central access component, but the authoritative sources are still distributed. The stronger design names those sources explicitly, documents replication paths, and treats the identity provider as a consumer and issuer of trust, not as the origin of every identity fact.

Appears on These Certification Paths

SC-900 • cloud IAM fundamentals • enterprise architecture and identity governance learning paths

Continue Learning

This lesson prepares the ground for federation, lifecycle, governance, and product authorization. Clean source-of-truth thinking is what keeps later automation from becoming brittle.

Quiz Time

Loading quiz…

Revised on Thursday, April 23, 2026

3.2 Non-Human Identities

3.4 Identity Attributes and Metadata