1. Trang chủ
  2. » Công Nghệ Thông Tin

Lecture Notes in Computer Science- P11 pot

5 313 0
Tài liệu đã được kiểm tra trùng lặp

Đang tải... (xem toàn văn)

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 5
Dung lượng 345,82 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

In this scenario, how to use the semantic explicit relations among the resources in the education portal to make better recommendation becomes a new challenge.. In this paper, we propose

Trang 1

many kinds of available relations In this scenario, how to use the semantic explicit relations among the resources in the education portal to make better recommendation becomes a new challenge

In this paper, we propose a complementary study on using the relations between resources and other entities in education portal to do recommendation Therefore, we separate this problem to the following two problems:

1 Formalize the recommendation problem in the education portal;

2 Using relations of the resources and other typed entities to do better recom-mendation in education portal

For an education portal, the recommendation usually happens when a user enters the portal A user can enter the portal by logining in or just using a guest account (anonymous viewer for the web page) In these two scenarios, recommendation is needed to choose the important learning resources in the education portal to be dis-played in the first web page when users enter the portal In this paper, we mainly focus on this kind of recommendation By calculating the important rank of each learning resource, we can decide which resources should be placed in the recommen-dation block of the portal to attract users

Advantages of the proposed approach are as follows:

1 It can make use of the resources’ relations in the education portal The relations can be dynamically added and removed;

2 The approach is easily adapted to recommend other types of entities, for exam-ple, to recommend categories

An example of the resources’ relation diagram is as figure 1:

Fig 1 An example of the learning resources’ relations

Trang 2

The rest of the paper is organized as follows: in section 2, we formalize the rec-ommendation of education portal in this paper In section 3, the proposed important ranking approach is presented in detail In section 4, we analyze the experimental results Finally, after giving the related works in section 5, we discuss the conclusions

in section 6

2 Problem Statements

Every education portal may have its own structure for entities and relations in it This paper proposes a general method to be easily adapted to fit for the specific structure in different education portals In problem statements, we will use the scenario in figure 1 to give example Such example doesn’t affect the generality of the proposed approach First, we give the definition of the entities of an education portal as follows:

Where R indicates resource, which is a core element for recommendation; U indicates user, C indicates category, T indicates tag, and D indicates a department in the

univer-sity In practical case, a resource may be a series of web pages for an online course, or

a web page for some knowledge tips, or even a web page for an announcement of academic forum Category is usually used to indicate which topic the resource is, for example, the resource is about Information Technology Tag is widely used in Web 2.0 in published resources It can be any keywords to help users to understand the resource or to give summarization of the resource Department is a specific concept in education portal, for example, the Department of Computer Science

Except entities, relations are also important in the education portal Here we define the relations as follows:

Where RC indicates the resource-category (belongto) relation; UT indicates user-tag (creates) relation; UR indicates user-resource (creates) relation; TR indicates tag-resource (annotate) relation; DU indicates department-user (contains) relation, CD

indicates category-department (managedby) relation

Thus, the education portal network can be represented by nodes and edges, which can be represented as follows:

E V

Where V is as follows:

}

Where V r indicates all the resources, V u indicates all the users, V t indicates all the tags,

V c indicates all the categories and V d indicates all the departments

E is as follows:

}

Trang 3

Where E rc indicates all the relations of resource-category; E ut indicates all the relations

of user-tag; E ur indicates all the relations of user-resource; E tr indicates all the

rela-tions of tag-resource; E du indicates all the relations of department-user; E cd indicates all the relations of category-department

The network model can be formalized as figure 2

Fig 2 The formalization of education portal network model

We define the transition probabilities λ for different relations As for random walk,

the transition probability can be viewed as the probability of a user jumps to another entity when he/she is visiting an entity in the network According to the transition theory, we have the following formula:

0

1 1 1 1 1

>

⎪⎪

=

=

=

=

= +

mn

cd rc tr du

ur ut

λ λ

λ λ λ

λ λ

(6)

Based on the above analysis, the recommendation in education portal can be for-malized as follows:

Recommend top n important resources in the education portal network by

impor-tance ranking

For other kinds of structures in education portal, the V and E can be different

How-ever, other structures can be easily adapted to our formalization by turning entities into nodes and relations into edges

3 Relation Based Importance Ranking

In this section, we will present our approach in detail Firstly, we will introduce our approach briefly And then, the detail is presented step by step

Trang 4

3.1 Brief Introduction of Our Approach

Our approach is mainly focus on using the relations between entities to calculate the importance of the resources This approach is similar to PageRank [5] in web search The major difference is that in PageRank, there is only one type of entity, which is web page, and one type of relation, which is hyperlink However, in an education portal, there have some semantic explicit relations between different typed entities Therefore, our approach can be divided into following steps:

1) Decide the transition probability In this special case, only λ ut and λ ur needs to

be decided

2) Importance Ranking Calculation The calculation is similar to PageRank in web search except that we have several types of entities and relations 3) Recommendation After the importance ranking calculation, recommendation

is simple The top n ranked resources will be selected to recommend to users

For the first step, a simple method is to give equal values to them Another method

is to try different pairs of values and use experimental results to select a proper one The second step is the core element in our approach and we will mainly describe the second step in the next section The third step is straight forward enough in current scenario

3.2 Detail Introduction of Our Approach

In this section, we will present our approach in detail A random walk [6] based ap-proach is applied to calculate the importance rank of the resources Firstly, we will give the random walk algorithm in this scenario For example, if you are visiting a

user, which is a node of V u in our network, you can have the probability of λ ur to

browse the resources the user creates Furthermore, if the user has created n resources, for each resource the user created, you have λ ur /n probability to visit it

We can formalize this scenario For any two entities in the network i and j, there is

a transition probability t ij , for example, if i is a node represents a user and j is a node represents a resource, the t ij is as follows:

ur r ij

V i count

>

=

) (

1

(7)

Where count(i->V r ) is the number of all the resource nodes the node i has user-resource relation with In the same way, for any two nodes x and y, we can calculate the transition probability of x and y by this method

Therefore, we can get the transition matrix A, whose element at row i and column j represent t ij Reference to PageRank, there is a random jump parameter α Random

jump parameter means that users can randomly jump to any other nodes in the

net-work other than the neighbor nodes We set α=0.15 in this case

T n n E

E

A , ( 1 , , 1 )( 1 , , 1 ) )

1 (

Trang 5

Where n is the number of the entities in the network The importance score can be viewed as a vector S as follows:

T n s s

Where s k represents the important score of entity k, there are n entities totally in the

education portal

Thus, the calculation of S is as follows:

S A

The calculation of S is similar to PageRank by using iterative method In the

fol-lowing section, we will describe the calculation methods in detail

Decide the transition probability In traditional PageRank in web pages, the

transition probability is the same in the link relation because there is only one type of relation, which is hyperlink However, in education portal, there may be different types of relations between entities For example, in our scenario, there are user-resource and user-tag relations in the user entity How to give transition probability to them is a problem Here we simply use the equal values to give transition probability

to them, which means λ ut =λ ur =0.5 In experimental evaluation section, we will

compare the result of different parameters’ values

Calculate the importance rank According to the formula (7) ~ (10), the calculation

of importance rank is as follows:

1) Give the initial importance to every node in the network Here we initialize

every entity’s importance equally, which is 1/n n is the number of all the

en-tities in the network

2) Calculate the transition matrix A′ using the formula (7) and (8)

3) Begin to calculate the importance rank of every entity in the network Until one of the finish conditions is met

The finish conditions are as follows:

1 If the sum of all the entities’ importance ranks’ variation is smaller than a value, which is marked asε

2 If the number of iteration has reached m

4) Output the importance rank of every entity

The process can be summarized as follows

1 Initialization Input all the transition probabili-ties ut, ur, du, tr, rc, cd; Initialize the im-portance vector S=[1/n, … , 1/n]; Initialize the number of iteration as m

2 Calculate the transition probability matrix A’ using formula (7) and (8);

Ngày đăng: 05/07/2014, 09:20

TỪ KHÓA LIÊN QUAN