Awesome
intranet-search-cockpit
Intranet search cockpit. Coperate Intranet search is not plug-and-play.
State:
Current state is "withdrawn". No coding will get done. Docs still get updates.
Background: It makes more sense to create new content in one system. Maybe later other sytems get integrated.
For the next days I will pratice this pattern: IUA Channel
Introducion
Content Repository: A content repository or content store is a database of digital content with an associated set of data management, search and access methods allowing application-independent access to the content, rather like a digital library, but with the ability to store and modify content in addition to searching and retrieving.
Intranet: An intranet is a computer network for sharing information, collaboration tools, operational systems, and other computing services only within an organization, and to the exclusion of access by outsiders to the organization. The term is used in contrast to public networks, such as the Internet.
In most coperate intranets there are several content repositories.
This has the drawback, that users don't have one search interface.
Internet search is easy: Just use ecosia or bing. But coperate Intranet search is .... (please send me your favorite term via mail: guettli.intranet-search-term@thomas-guettler.de)
Two Options
- Option1: Constantly index the content repositories in background. The search gets executed on the own index.
- Option2: Use the search API of the content repositories. No own index exists. The result of N systems gets displayed on one page.
Tools
Indexing + DB (low-level) (Option1)
Stackoverflow Tag-trend of above tools
Detecting which pages have changend: RSS
High level
- manifoldcf looks dated.
- opensemanticsearch Community project or one-man-show? github contributors Contact
- List of open source Enterprise Search Software
- Spec: OpenSearch (for example stackoverflow.com)
List of Content Repositories
Examples:
Goals
- Provide one search interface for several content repos.
- Every search should get logged. This allows to understand the employees needs and improve the search results in the future. (Build-Measure-Learn feedback loop)
Stretch Goals
- Extend knowledge of a content repository. Example: In the coperate intranet the definition of Scrum Master at Wikipedia can get extended by local annotations. These local annotations provide additional details how to apply the general (world wide) knowledge to the particular ??? inside the company.
Cockpit
Good search results don't fall from the sky. You need a way to point the non-intelligent indexers into the right direction.
- Configure Content Repositories. Store credentials. GUI to add new repos
- A way to add hints to the non-intelligent indexers. I guess this is needed, since there are not enough hyperlinks between the documents to get a solid PageRank
Related
ETL
ETL: In computing, extract, transform, load is the general procedure of copying data from one or more sources into a destination system which represents the data differently from the source(s) or in a different context than the source(s).
Authentication Libs
- libsaas (Python) unmaintained.
Content Repos
Confluence
Github
Stackoverflow Teams
Change Name?
"Intranet" is dead. We don't own servers, we don't run a corporate network. It is about searching corporate Saas pages. Alternatives: multi-saas-search, SaaSRank, SaaS-Umbrella, ...
Later
Platform
Make it easy to add new SaaS sites and authentication methods. Add "platform" to the name.