コンテンツにスキップ

利用者:Clapon/カテゴリ関連文書/Category intersection

このページはカテゴリ関連文書を理解するためのClaponの個人的な作業用ページです。

//Category intersectionって、どう訳せばしっくりくるだろうか… //とりあえず Intersection =「共通集合」としておく。


Category intersection は1つ以上のカテゴリがつけられている記事を見つけやすくするための方法です。メディアウィキの仕様とカテゴリ付与の方針を大きく変える必要があります。これらの変更がカテゴリ付与の問題を解決し、Wikipedia編集者の紛争を終わらせるのではないかと期待されています。Category intersectionはカテゴリを簡単に管理するだけではなく索引としてもメリットがあります。

Category intersectionはかなり以前から望まれていました。wikitech-lメーリングリストでは(重要そうではないので訳すのはやめる)

多くのカテゴリは共通した「第一」カテゴリがあります。たとえばen:Category:American actorsen:Category:Actorsen:Category:American peopleの共通集合カテゴリです。とはいえ、これらの「第一」カテゴリは大抵サブカテゴリに分けられています。もしそれらのサブカテゴリがそのまま配置されていれば、共通集合カテゴリは自動作成できるかもしれません。 ドイツ語版ウィキペディア カテゴリ はすでに完全な第一カテゴリが作成されています。 ここでは実際に共通集合カテゴリの作成を必須することを提案します。システムを置き換えるのではなく、機能追加することを目的としています。

Category intersection is the ability to find all articles that are members of more than one category. It requires a change to the MediaWiki software as well as a major change to the policies related to how categories are populated. It is hoped that these changes will solve some long standing categorization problems and end some common conflicts between Wikipedia editors. Category intersection also offers the possibility of adding several new features that will benefit users by adding valuable research and indexing tools as well as making the category system easier to manage.

Category intersection has been a desired feature for quite some time. Looking through the wikitech-l mailing list archives, someone even wrote the code implementing a version of category intersection. This comment points out its limitation: "I don't see how this can be more than marginally useful unless it also searches all subcategories to infinite depth (with recursion checks?!)."

Many existing categories are logically the intersection of attributes for which "primary" categories exist, for example Category:American actors is logically the intersection of Category:Actors and Category:American people. Although these "primary" categories are today generally subdivided into subcategories, if they were directly (fully) populated the "intersection categories" could be automatically generated. Categories in the German Wikipedia are already organized into fully populated primary categories.

This proposal tries to envision the changes necessary to make category intersection a reality. It is designed to augment the current categorization system, not replace it.

背景 (Background)[編集]

カテゴリが最初にウィキペディアで使われたのが2004年で、当時は大きなカテゴリの検索結果数を制限する機能はありませんでした。とても大きなカテゴリはパフォーマンス上問題があり、検索結果を200件に絞るというソフトウェアの改良が行われました。200以上のエントリがあった場合、ユーザはすべてのエントリを見るために複数ページを移動しなければいけなくなりました。このページ間の移動機能は大きなカテゴリに対しては実用的でないものになりました。大きなカテゴリのパフォーマンスの問題とページ間のナビゲーションによって、大きなカテゴリはより小さいサブカテゴリに分けられるようになりました。

2005年に、カテゴリの目次テンプレート {{CategoryTOC}}が作成されました。目次によって、大きなカテゴリも数回のクリックで目的ページへたどり着くことができるようになりました。これらの機能があれば、カテゴリが小さくなければならないという理由はもはや何もありません。

当初からカテゴリ分けの案として複数のカテゴリ分類法があげられています。カテゴリ付けやサブカテゴリに分ける方法はひとつではありません。これらのサブセットカテゴリがあることで、より上位のカテゴリをすべて把握することが難しくなっています。サブセットカテゴリと上位カテゴリの両方に追加するか、サブカテゴリやそのまた下位のサブカテゴリを列挙するかのどちらかしかありません。記事が「下位レベル」と「上位レベル」の両方に入れるべきか、また今つけられているカテゴリがふさわしいものか正確に判断する方法は、編集者の間で長く議論されています。

これらの議論からウィキペディアにおけるカテゴリの目的といくつかの明確なカテゴリに分けることが重要だとわかりました。

  • カテゴリは閲覧のためのツールである: 目次の機能は特定のテーマに沿って記事を探しやすくします。このタイプのカテゴリの例としてen:Category:Film actorsがあります。
  • カテゴリは記事を分類するものである: 現在のルールではその記事にとってもっとも詳細なカテゴリを割り当てるようになっています。このような分類は時として閲覧の制限になることがあります。例えばつり橋(suspension bridges)は、en:category:Bridgesではなくen:Category:Suspension bridgesに割り当てられます。この状態では橋を記事を探すのが難しくなります。橋の形状に詳しい人やある形状に興味のある人を除いて。
  • カテゴリはテーマの索引である: カテゴリ分けの最近のルールのせいでサブカテゴリに細分化されているため、多くのトピックレベルのカテゴリは索引としては役に立ちません。例えばAmerican peopleにあるすべての記事の一覧を見る方法はありません。監督は「映画監督」よりも「アメリカの映画監督」のほうが注目されやすいように、そのように「注目度」によってカテゴライズしたほうが使いやすいかもしません。
  • カテゴリはデータベース検索である: 多くのカテゴリは本質的には二つ以上の大きなカテゴリの共通集合です。例えば、en:Category:American film directorsen:Category:Film directors and en:Category:American peopleの共通集合と考えられます。多くの共通集合カテゴリは有益だと思われる人がいるからこそ存在します。 これらのカテゴリをどんどん追加すると、記事のカテゴリリストは散らかる一方ですからこれらの行為は推奨されませんし、そのためカテゴリが削除されることも多くあります。さらにこれらのカテゴリは手で追加されるのでen:Category:Film directorsen:Category:American peopleの両方にある記事がen:Category:American film directorsに存在しなかったり、そのまた反対にen:Category:American film directorsにある記事がen:Category:Film directorsen:Category:American peopleに存在しない場合もありえます。
  • カテゴリは他のカテゴリの索引である: 単純に他のカテゴリの索引としての役割を果たしているカテゴリがあります。例えば、en:Category:Categories by nationalityen:Category:Categories by countryのほぼすべてのサブカテゴリは共通集合カテゴリの索引となります。

When categories were initially added to Wikipedia in 2004 there was no mechanism to limit the search result for large categories. Very large categories caused performance problems, and a software change was made to limit the search result to 200 entries at a time. If there are more than 200 entries, users must navigate through multiple pages in order to see all the entries. This page by page navigation mechanism becomes impractical with large categories, as it takes much too long to see the entries at the end of the alphabet. The performance considerations of large categories and page-by-page navigation precipitated policies to depopulate large categories into smaller subcategories.

In mid 2005 the category table of contents template, {{CategoryTOC}}, was created. With the table of contents it became possible to navigate through very large categories with a few clicks. Due to the combination of the performance change and CategoryTOC, there is no longer any reason that categories need to be small.

Multiple category taxonomies have been part of the categorization scheme from the beginning. It is possible to take a category and subcategorize it in many different ways. Use of these "subset" categories makes it difficult to find all members of a "higher level" category; either articles have to be added to both the "subset" and "higher level" categories or the members of the "subcategories" (and, recursively, their subcategories) have to be enumerated. Precisely defining the circumstances in which articles should be added to both "lower level" and "higher level" categories, and even whether this is ever appropriate, remains a source of continuing discussion among editors (see, for example, Wikipedia:Categorization/Categories and subcategories and Wikipedia talk:Categorization/Archive 7).

This history has led to several overlapping views about the purpose of Wikipedia's categories and to the creation of several distinct kinds of categories:

  • Categories are a tool for browsing: they function as a table of contents, leading users to the articles on a specific subject. An example category of this type is Category:Film actors.
  • Categories are a means of classifying articles: the current conventions encourage placing articles in the most specific category. Having categorization function as a classification system is often in conflict with categorization as a tool for browsing. For example, suspension bridges are added to Category:Suspension bridges, but not category:Bridges. This makes bridges hard to find by browsing unless the user already knows the type of bridge (or is only interested in certain types of bridges).
  • Categories are an index of a subject: Due to the current conventions for categorization, many topic level categories are not useable as an index because they have been broken into subcategories and depopulated. For example there is no way to see an index of all American people. It would be useful to have categories fully populated at the "level of notability". For example directors are much more likely to be notable as "film directors" than as "American film directors".
  • Categories are a database search: Many categories are in essence the intersection of two or more larger categories. For example, Category:American film directors can be thought of as the intersection of Category:Film directors and Category:American people. There are many intersection categories that do not exist that some people might find useful. Adding more and more of these categories clutters up the category listings for articles so they are discouraged and often deleted. In addition, since these categories are manually populated it is entirely likely that an article in both Category:Film directors and Category:American people does not appear in Category:American film directors or, conversely, that an article in Category:American film directors does not appear in one or both of Category:Film directors or Category:American people.
  • Categories are an index of other categories: There are many categories that function simply as an index of other categories. For example, nearly all the subcategories of Category:Categories by nationality and Category:Categories by country are index categories providing an index of a specific set of "X by Y" intersection categories.

主な提案 (The core proposal)[編集]

  • サブカテゴリに分解されている第一カテゴリをいっぱいにします。
  • その場で共通集合カテゴリを作成できる機能を追加する。すべてのユーザはカテゴリを選択し、選択結果の中から共通集合を作成できます。
  • 記事の下部には第一カテゴリのみ表示します。
  • どの記事からも共通カテゴリを選べるシンプルなインタフェースを作成します。
  • Fully populate most of the primary (topic level) categories that have been broken into subcategories.
  • Add the capability to create category intersections on the fly. All users will be able to select categories and create an intersection from the selection.
  • Show only primary (topic level) categories on the bottom of article pages.
  • Create a simple interface to select category intersections from any article page.

第一カテゴリの充実 (Fully populated primary categories)[編集]

共通集合カテゴリを最適化するには、多くのカテゴリにたくさんの記事を入れていっぱいにするべきです。カテゴリは定義が一致するすべての記事が投入されていたり、サブカテゴリのみであったりする必要があります。もしカテゴリが十分に充実されていたら、それは「第一」カテゴリと呼ぶにふさわしいでしょう。第一カテゴリはその主題となる記事、すなわちそのカテゴリの名前の由来となった記事と一致するべきです。例えばen:Film directorなどです。

ルールはこのようになっています:もし完全にほかのカテゴリの共通部分について述べられているカテゴリであれば、第一カテゴリではなく共通集合カテゴリとして定義するべきです。たとえば、en:Category:American film directorsen:Category:Film directorsen:Category:American peopleの共通集合として定義されるので、第一カテゴリの充実につながります。en:Category:American film directorsは通常のカテゴリとしては存在しませんし、どんな記事にもカテゴリとしては登場しません。それぞれの共通集合を選択すると両方のカテゴリに含まれる記事が表示されます。もし理由なく共通集合カテゴリに入っているのではなく主題に関係ある記事があるのなら、共通集合カテゴリの説明文に「~を参照」とリンクされるでしょう。たとえば、共通集合カテゴリのen:Category:American film directorsに対応するen:American film directorsをコメントに追記するようなことです。

共通集合であるすべてのカテゴリの中身をより大きな第一カテゴリに移動します。第一カテゴリはやや大きすぎる場合もあるでしょう。(en:Category:American peopleのように。)このようにカテゴリをいっぱいにすることにより、その主題に関連したウィキペディアのすべての記事の完全な索引を作ることができます。

この提案では、記事に表示されるカテゴリリストも変更されます。いっぱいになった第一カテゴリだけが表示されます。たとえばen:Laurence Fishbourne という記事は現在、以下のカテゴリを含んでいます。

Categories: 1961 births | African-American actors | American film actors | American television actors | Best Actor Academy Award nominees | Living people | M*A*S*H actors | Miami Vice actors | A Nightmare on Elm Street actors | People from Augusta, Georgia

提案内容を適用するとこのようになります。:

Categories: 1961 births | American people | People of African descent | Actors | Film actors | Television actors | Best Actor Academy Award nominees | Living people | M*A*S*H actors | Miami Vice actors | A Nightmare on Elm Street actors | People from Georgia (US State) | People from Augusta, Georgia

少し補足しましょう。このカテゴリの定義に混乱するかもしれません。"People from"カテゴリは現在、住民とはかぎらずにその土地と顕著な関係がある人物を定義したカテゴリです。これはより小さな部分や大きな部分のどちらもが第一カテゴリとなりうるということです。共通集合カテゴリを導入するために、国家以下のすべての地理項目を充実させておくとよいでしょう。同様に、「映画俳優」を「映画」と「俳優」の共通集合として定義することはできません。有名な舞台俳優であった人が後に映画俳優になるかもしれないからです。記事はこれら2つのカテゴリに振り分けられますが、共通集合カテゴリ自体に振り分けられるわけではありません。ですので「映画俳優」は共通集合カテゴリの候補にはなり得ません。これらの理由から「俳優」とそこから派生するカテゴリすべてが第一カテゴリとなると考えられます。

For category intersection to work best, many categories must be fully populated. Categories will need to populated with ALL articles that meet the definition of the category or to have NONE of the articles because they can be found in subcategories. If a category is fully populated, it would be called a "Primary" category. Primary categories should correspond to topic articles. That means that there is or could be an eponymous article for the category. An example of this is Film director.

The general rule would be: If a category can be completely and totally expressed as the intersection of other categories, it is not a primary category and should be defined only as this intersection. For example, Category:American film directors can be defined as the intersection of Category:Film directors and Category:American people, which would in turn be fully populated primary categories. Category:American film directors would not exist as a "regular" category, and would never appear as a category in any article. Articles in both categories would be displayed by selecting to view their intersection. If there are articles that relate to an intersection topic, but for some reason are not in one or more of the intersected categories, they can appear as normal wikilinks in a "See also" section in the intersection category's text description. For example there might be a comment to see an article called American film directors in the intersection corresponding to Category:American film directors.

All existing categories that are intersections would be depopulated and their members moved to the larger primary categories. Some primary categories will be rather large (like Category:American people). Since they are fully populated, each primary category will be a complete index of all the articles in Wikipedia that relate to the topic.

This proposal will change the list of categories that appears on articles. Only the primary (fully populated) categories will appear. For example, the Laurence Fishbourne article currently contains the following categories:

Categories: 1961 births | African-American actors | American film actors | American television actors | Best Actor Academy Award nominees | Living people | M*A*S*H actors | Miami Vice actors | A Nightmare on Elm Street actors | People from Augusta, Georgia

Under this proposal it would contain:

Categories: 1961 births | American people | People of African descent | Actors | Film actors | Television actors | Best Actor Academy Award nominees | Living people | M*A*S*H actors | Miami Vice actors | A Nightmare on Elm Street actors | People from Georgia (US State) | People from Augusta, Georgia

There are a few things to note about this. The definition of some of these categories might be confusing. The "People from" categories are currently defined as people who have a notable connection with the place, but might not be citizens of the larger country. This means that both the smaller and larger subdivisions are primary categories. For the sake of facilitating intersection categories it would probably be useful to fully populate all geographical subdivisions from the level of nationality on down. Likewise, it is not possible to define film actors as the intersection of "film" (or film people) and "actors", because (for example) a person could be a famous stage actor who later became a film director. Articles might belong in these two categories but NOT belong in the "intersection" category, which means "film actors" is not a candidate for an intersection category. For this reason it might be decided to make "actor" and all the "actor by medium" categories primary categories.

共通集合カテゴリのための名前空間 (New namespace for category intersections)[編集]

共通集合カテゴリのために新しい名前空間を作成します。ページはこの名前空間の中におかれ、"Index" や "Intersection" といったカテゴリとよく似た記事の一覧とわかる空間名になるでしょう。この提案の中ではどちらの名称もすでに使われていますが、提案が実現したときには別の名前になるかもしれません。共通集合のページはその場で作成され、探すのに容易なシンプルな名前になります。たとえば、Intersection:Actor::American people::People of African descent(日本語訳:Intersection::俳優::アメリカ合衆国の人物::アフリカ出身の人物)というように。同様に、これは共通集合ページへのリンク記述用の名前として使えます。(補足:正確な記述法とURLは違います。)共通集合カテゴリへのリンクは以下のように作成します。:

[[Intersection:Actor::American people::People of African descent]]

ほかのリンクと同様にページへのリンクはリンク先のそのままの名称を表示しなくてすむ「パイプ」を使うこともできます。このリンクは共通集合名前空間に該当記事がなくても青字(ページが存在する場合)として表示されますが、指定されたカテゴリがひとつでもなければ赤リンクとなります。これはすなわち、ユーザが前もって共通集合ページを作っていようが作っていまいが、存在するすべてのカテゴリに対して共通集合が作成できるということです。

共通集合ページはカテゴリページと似通ったものになるでしょう。ページ名があり、執筆者によって追加された概要があり(「編集」で追記できます)、その共通集合カテゴリに含まれるすべての記事やサブカテゴリへのリンクのうち最初の200件があるページとなります。(通常のカテゴリ表示とそっくりでしょう。)おそらく、続いて共通集合の表示件数制限を拡大する仕組みができるでしょう。

ページ名はアクセスする記事がもともと持っているURLに応じてソートされ、共通集合カテゴリに表示されます。カテゴリAとBの共通集合と、カテゴリBとAの共通集合は同じものですから、共通集合カテゴリには同じ内容のページがいくつかあります。詳しくは後述します。

There will be a new namespace for the creation of category intersections. Pages in this namespace, perhaps called "Index" or "Intersection" would look very similar to a Category listing of articles. In this proposal both names are used, but any other name could be selected when this proposal is implemented. Intersection pages can be created on the fly, simply by typing the name of the intersection you are looking for. For example, you could go to the page Intersection:Actor::American people::People of African descent. Likewise, this would be the mark-up for creating a link to an intersection page. (Note: The precise mark-up and URL might look different from this.) So you could add a link to an intersection page by adding:

[[Intersection:Actor::American people::People of African descent]]

to a page. Like any other link, these links could be "piped" so the text displayed to the user would not have to be the "raw" link. The link would display as a "blue" link (page exists) not based on whether there is an existing page in the intersection namespace but based on whether all the categories being intersected exist in the category namespace. This means any intersection of existing categories would appear to exist, whether a user has previously "created" the intersection page or not.

Intersection pages will look more or less like category pages. The title of the page would be displayed, possibly followed by manually generated content (added by clicking "edit"), then the first 200 automatically generated links to the subcategories and articles that are members of all the intersected categories (much like a regular Category listing), perhaps followed by a mechanism to expand or further limit the intersection.

The page title will list the categories being intersected in the order specified in the URL used to access the page. Because Category A intersected with Category B is the same as Category B intersected with Category A, intersection pages have a number of built-in synonyms. More about this later.

ユーザにて作成された共通集合カテゴリ (User created category intersection)[編集]

共通集合カテゴリの作成方法はいくつかあります。

  1. 共通集合のURLを入力する。
  2. 検索ボックスに共通集合のページ名を入力し、「表示」をクリックする。
  3. 記事内に共通集合へのリンクを作成する。(討論するときや共通集合のリストを作るときに役立ちます。)
  4. 記事の下部に表示されているカテゴリリストから選択する。
  5. (変形パターンとして)ほかの共通集合が表示されているところから別のカテゴリを選択する。

4番目(と5番目)の項目は新しく強力な機能です。先ほどと同じen:Laurence Fishbourneを例にとって説明します。カテゴリリストは以下のように表示されます。

Categories: 1961 births | Living People | American people | People from Georgia (United States) | People from Augusta, Georgia | People of African descent | Actors | Film actors | Television actors | Best Actor Academy Award nominees | M*A*S*H actors | Miami Vice actors | A Nightmare on Elm Street actors
[選択したカテゴリと一致する記事を表示する]
このリンクの文言は変わるかもしれません。たとえば「選択したすべてのカテゴリのインデックスを作る」など。これはなに?というリンクもできるかもしれません。

This arrangement is very similar to how tags work at Flickr.com. The existing category listing would have a check box added beside each category. Any user would be able to view the result of a category intersection by checking the boxes next to the categories and then clicking on the link to view the intersection set. In this case the three checked boxes would lead to an intersection listing that is functionally very similar to the current Category:African-American actors, but dynamically generated based on an intersection of the selected categories rather than manually populated. Many existing categories could be replaced with intersections, and with this system any intersection is possible, including ones that have been previously discouraged and/or deleted via WP:CFD.

This adds a small amount of category "clutter", but adds the possibility of generating the intersection of any two or more categories. There may be a few more "primary" categories than now exist, but overall there may ultimately be fewer categories listed per article. ALL of the categories appearing at the bottom of an article would be fully populated primary categories and so would be useful as the components of intersections. You would be able to see the intersections even if nobody had explicitly created an intersection page for it, for example what would now have to be Category:African-American film actors from Augusta, Georgia who appeared on M*A*S*H. This creates the effect of having scores of categories without cluttering up articles.

Searches in the Intersection namespace will be done by first sorting the intersected categories into alphabetical order before doing a database query to find an existing intersection. This way any permutation of category order in a URL or link will match the appropriate intersection page.

There will be several ways for users to create category intersections:

  1. By typing the URL of the intersection.
  2. By typing the name of the intersection in the "Search box" and clicking on "Go".
  3. By creating a link to the intersection on a page and then clicking on the link. (This will be useful for discussions and for creating lists of intersection pages.)
  4. By selecting categories listed at the bottom of article pages.
  5. (in some variants) By selecting other categories to intersect from another intersection display

The fourth (and fifth) option would be a new and powerful feature. Using the same Laurence Fishbourne article as an example, instead of the existing category listing the categories might be displayed like this:

Categories: 1961 births | Living People | American people | People from Georgia (United States) | People from Augusta, Georgia | People of African descent | Actors | Film actors | Television actors | Best Actor Academy Award nominees | M*A*S*H actors | Miami Vice actors | A Nightmare on Elm Street actors
[Show articles in all selected categories]
The exact wording of the link might be different, e.g. "Create index using all selected categories". There might also be a link that says "What is this?

This arrangement is very similar to how tags work at Flickr.com. The existing category listing would have a check box added beside each category. Any user would be able to view the result of a category intersection by checking the boxes next to the categories and then clicking on the link to view the intersection set. In this case the three checked boxes would lead to an intersection listing that is functionally very similar to the current Category:African-American actors, but dynamically generated based on an intersection of the selected categories rather than manually populated. Many existing categories could be replaced with intersections, and with this system any intersection is possible, including ones that have been previously discouraged and/or deleted via WP:CFD.

This adds a small amount of category "clutter", but adds the possibility of generating the intersection of any two or more categories. There may be a few more "primary" categories than now exist, but overall there may ultimately be fewer categories listed per article. ALL of the categories appearing at the bottom of an article would be fully populated primary categories and so would be useful as the components of intersections. You would be able to see the intersections even if nobody had explicitly created an intersection page for it, for example what would now have to be Category:African-American film actors from Augusta, Georgia who appeared on M*A*S*H. This creates the effect of having scores of categories without cluttering up articles.

Searches in the Intersection namespace will be done by first sorting the intersected categories into alphabetical order before doing a database query to find an existing intersection. This way any permutation of category order in a URL or link will match the appropriate intersection page.

Options and variations[編集]

There are several variants of this proposal. The aspects that vary relate to:

  • How intersection pages are displayed
  • The interface for navigating around intersection space
  • How intersection space relates to category space
  • The conversion of current categories into intersections.

For each option, mockups and a subpage with further details are provided.

Option: Transclude intersections into categories[編集]

This option closely links intersections with categories. Categories that can be defined as intersections would be depopulated but would still remain in the category structure. Instead of adding articles into the category, the intersection page would be associated with a category page by giving it the category name. Once named, the intersection page would be bound to and automatically transcluded into the associated category. The current categorization structure would not be affected with this option. All currently existing categories would remain, with some being "regular" categories and some being redefined as "intersection" categories.

The basic features of this option:

  • Categories get re-defined as the intersection of fully populated primary categories when appropriate.
  • Articles can be automatically recategorized from intersection categories into the corresponding primary categories. This can happen when a category is first associated with an intersection and then later if any articles are added to the intersection category.
  • The categorization system is protected from vandalism by restricting some maintenance and editing to administrators.

Mockups:


Further details about this option: Wikipedia:Category intersection/Transclude intersections into categories

Option: Named indexes, separate from categories[編集]

This option is modeled after the look of an index that might be found in a book. In this option, the intersection space uses the name "Index" and its pages contain indexes of articles as well as links to more index pages. Like the option above, the "index" pages can be given names. Unlike the option above, the "index" pages are not associated with or transcluded into categories. Categories that can be defined as intersections will be deleted after recategorizing articles into appropriate primary categories. The deleted categories are replaced by indexes which can be categorized or manually linked to category pages. The "index" pages have sets of links to other indexes which are automatically generated by using the subcategories of the intersected categories. This allows users to easily traverse from one index to other related indexes. Pages in the index namespace could be edited (much like categories can be edited), allowing users to annotate the index page with descriptive text, add index pages to categories and add links to other related indexes.

The basic features of this option:

  • "Primary" categories are fully populated (using bots) by recategorizing articles from "intersection categories" into the corresponding primary categories.
  • Categories that can be defined as the intersection of fully populated primary categories will then be deleted.
  • Index pages show the articles that result from the intersection of the primary categories as well as the sub-indexes that are the intersections using subcategories.
  • Index pages will replace "index categories" allowing traversal to numerous "sub-intersections" from the intersection selection table shown on the index intersection page.
  • Some traversal in "intersection space" does not rely on a user created hierarchy.
  • Intersections are given easily understood names, for example "Index:African-American actors" instead of "Index:Actor::American people::People of African descent"

Mockups:


Further details: Wikipedia:Category intersection/Named indexes, separate from categories

Option: Separate intersection space[編集]

In this option the "intersection" namespace would be completely separate from the "category" namespace. Categories that could be defined as intersections would be deleted after recategorizing articles into appropriate primary categories. Every page in the intersection namespace would include an automatically generated intersection selection table, allowing users to easily traverse from one intersection to other related intersections. Pages in the intersection namespace could be edited (much like categories can be edited), allowing users to annotate the intersection page with descriptive text and to add intersection pages to categories.

The basic features of this option:

  • Fully populate (using bots) "primary" categories by recategorizing articles from "intersection categories" into the corresponding primary categories.
  • Delete categories that can be defined as the intersection of fully populated primary categories.
  • Replace "index categories" with index intersections, allowing traversal to numerous "sub-intersections" from the intersection selection table shown on the index intersection page.
  • Traversal in "intersection space" does not rely on pre-created links or categorization of intersections.
  • Intersections have only a functional name including the names of the intersected categories, with no "user friendly" name. This eliminates the need to establish guidelines for these names, or in any way control or manage them.

Mockups:


Further details: Wikipedia:Category intersection/Separate intersection space

Other variations[編集]

Other variations are possible. It is possible to combine, exchange and remove features from the three options above to create other options. We invite participants in this discussion to add any ideas they may have.

Changes to categorization policy[編集]

This proposal, if any of the options are implemented, will have a major effect on categorization policy. Some of these changes can be foreseen, and some will evolve as everyone gets used to the new system. Considerable thought and planning also has to be done before implementing the change.

Once the new system is in place categorization policy will need to be revised. Many aspects of the new system will likely be controversial and it is likely that there will be lively discussion. It is also possible that there will be less controversy than with the current system.

Primary categories[編集]

The main change to policy will be the concept of a Primary category as described above. Primary categories should be tagged as such, so editors will know to fully populate them. Some categories may need to be split because they are both primary categories and navigational categories. A navigational category is a category which contains subdirectories. An example of this is Category:American people by occupation. Navigational categories should not contain any articles. Currently, Category:American people functions as both a primary category and a navigational category. It probably should be split into Category:American people (which would be fully populated with articles about Americans,) and Category:American people by type or something similar (which would have all or most of the subcategories. Category:American people by type would then be a subcategory of Category:American people. This will make it easier to navigate through the subcategories, especially when primary categories are very large and have many subcategories.

Categories as a table of contents: Browsing[編集]

The current guidelines say that categories are primarily meant as a method to browse through articles on a topic. This guideline does not need to change.

Categories as an index: Primary categories[編集]

Since primary categories will be fully populated, they will also function as a complete index of their topic. This feature will no longer be at odds with other functions of categories. The intersection pages will add additional indexing capabilities.

Categories as classification[編集]

Instead of classifying articles by finding the most specific subcategory for the article, they will be classified by finding all the primary categories they belong in. Their classification is in essence the intersection set selecting all their categories. In most cases there will not be any other articles with the same set of primary categories.

Multiple taxonomies[編集]

Many subcategories have been discouraged or deleted because they were not considered important sub-classifications of existing categories. This would no longer be a problem with the new system. Adding attributes to people like sex (Category:Men or Category:Women) or religion (Category:Methodists) should no longer be controversial because sub-categories using these attributes will only be seen if people are looking for them. This will allow multiple taxonomies to coexist.

Currently, certain taxonomies are preferred, such as subcategories by nationality and occupation. This will no longer be the case. No taxonomy will appear to be better than any other. Certainly, taxonomies could still be removed if they are shown to be unencyclopedic. Deleting these taxonomies will only require deleting a single primary category. Once deleted the intersection pages will no longer show any articles. Any links to intersection using the deleted primary category will be red. In option one, any category with an intersection that has a red link should be a candidate for speedy deletion. In all the options, any intersection page that has a red link to a primary category should also be a speedy deletion. This process can probably be automated with a bot.

Categories as a database search[編集]

This system, like the system at flickr.com makes it easy to find articles that are similar in desired ways. From one actor from Ohio, a user will be able to find all actors from Ohio. From one English poet born in 1883, you will be able to find all English poets born in 1883. From one suspension bridge in New York City, you will be able to find all suspension bridges in New York City. This is currently not possible for most searches.

Future related upgrades[編集]

Category viewed as an outline[編集]

Currently there is a clear consensus to not put people in Category:Entertainers and instead, put them into the subcategories of entertainers. It might be useful on occasion to see a complete index of what is in Category:Entertainers, including all the contents of subcategories. A future upgrade might add the ability to view any Category into an outline. Perhaps there would be a link at the top of each category that say "View as an Outline". When the link was clicked, the category view would switch to an outline view. All the subcategories and articles would appear as single alphabetical list. The subcategories would be formatted differently from the articles (perhaps in bold or a larger font). There'd also be another option that said "Show contents of all subcategories" Clicking on this would add the contents of the subcategories to the category or list. If both options are selected the subcategory contents would be indented and listed directly under the subcategory heading. The index view would only go a set number of levels deep and would not show the contents of any categories that are defined as intersections. Perhaps the depth of the index could be a user preference. There might also be a way to "flatten" the outline so that the contents of all the subcategories were combined into a single alphabetical list.

Searching in categories[編集]

The search interface could be extended to include the ability to find articles in specific categories as well.

Comments[編集]

Please respond on the talk page.