Distance and similarity measures are very important in clustering, pattern recognition, decision-making and other scientific fields. For the existing hesitant fuzzy distance, most of them do not consider the hesitance degree. Even if the hesitance degree is considered, only the degree of dispersion or the number of hesitant fuzzy values are considered. Aiming at these shortages, a new hesitance degree is defined, which has better accuracy and applicability. Then, some hesitant fuzzy distance measures based on the proposed hesitance degree are proposed, which can overcome some shortcomings of the existing distance measures. Finally, the new hesitant fuzzy distance is applied to the hierarchical hesitant fuzzy k-means clustering algorithm, and an illustration example is given to illustrate the effectiveness of the proposed method.

Introduction

The theory of fuzzy sets proposed by Zadeh [1] has achieved a great success in various fields. Afterwards, many new theories and approaches about uncertainty and imprecision have been proposed by scholars, such as intuitionistic fuzzy sets(IFS) [2], interval-valued intuitionistic fuzzy sets [3], linguistic variables [4], type-2 fuzzy sets [5], fuzzy multiset [6], picture fuzzy sets(PFS) [7], etc. With the growing complexity and uncertainty of the real-life problems, it is hard to establish the degree of membership of fuzzy set. To do this, Torra [8] introduced the concept of hesitant fuzzy set(HFS) which permitted the membership having a set of possible values. As an extended form of the fuzzy set, hesitant fuzzy set can better simulate the hesitation preference of decision makers to deal with the actual situation of people hesitating between several possible values. Since the hesitation fuzzy set came out, it has received extensive attention and obtained rich research results. For example, Zhang [9] proposed the hesitant fuzzy power average operator, it is characterized by the weight of hesitation fuzzy information depends on the degree of support for it with other hesitation fuzzy information. Considering that attributes may be related to each other in realistic decision-making problems, Zhu [10, 11] proposed the hesitant fuzzy Bonferroni mean operator and hesitant fuzzy Bonferroni geometric operator. Wei [12] considered the priority relationship between attributes and proposed the hesitant fuzzy prioritized operator. Xu et al. [13] introduced a hesitant fuzzy TOPSIS method based on the principle of maximum deviation and applied it to multi-attribute decision-making problems. Liao et al. [14] presented the hesitant fuzzy VIKOR multi-attribute decision-making method considering the psychological preference of decision-makers. Wang et al. [15] introduced the prospect value function of hesitant fuzzy elements based on prospect theory and distance measure, and then proposed a multi-attribute decision-making method according to the TOPSIS method that considers the risk preference of decision maker. Hesitant fuzzy sets also have been applied to the other fields such as cluster analysis [16–19], decision analysis [20–23] and pattern recognition [24–27] and so on.

Distance measure is one of the important direction in the theory of hesitant fuzzy set. So far, many research results on hesitant fuzzy distance have been obtained. For instance, Xu and Xia [28] first proposed a variety of hesitant fuzzy distance measures and discussed their properties. On the basis of hesitant fuzzy distance measure by Xu, Tong [29] introduced a hybrid hesitant fuzzy distance measure considering the preference of decision makers. And Peng [30] presented a generalized hesitant fuzzy cooperative weighted distance measure. Although the above hesitant fuzzy distance measures have many merits, they require that each corresponding hesitant fuzzy element has the same length. When the length of hesitant fuzzy elements is not equal, it is necessary to add elements to meet the requirements. However, this is bound to change the original information of hesitant fuzzy elements. That is to change the real expression of experts. To overcome the shortcoming, Tang et al. [31] proposed a distance measure without considering the length of the hesitant fuzzy element. But except for the length of the hesitant fuzzy element is 1, the distance between two identical hesitant fuzzy elements is not equal to 0, which is contrary to the fact. Later, some researchers further consider the hesitance degree of hesitant fuzzy element in distance measure. Zhang and Xu [18] proposed the concept of hesitation index which determined by the degree of dispersion of hesitant fuzzy values in the hesitant fuzzy element, and proposed a series of distance and similarity measures that consider hesitation index of hesitant fuzzy sets. Li et al. [32] proposed the concept of hesitance degree which determined by the number of hesitant fuzzy values in the hesitant fuzzy element, and proposed a series of hesitant fuzzy distance measures containing hesitance degree. However, it needs to be pointed out that the hesitance degree mentioned above only considers the degree of dispersion or the number of hesitant fuzzy values in the hesitant fuzzy element, which is imperfect and has the defect of insufficient discrimination.

Methods

According to the above analysis, the existing hesitant fuzzy distance measures have different shortcomings. To overcome the shortcomings, we first define a new hesitance degree by considering the degree of dispersion and the number of hesitant fuzzy values in hesitant fuzzy element, and put forward some distance measures based on the proposed hesitance degree. The distance is divided into two cases of equal length and unequal length between two hesitant fuzzy elements, which can solve the problem of original information distortion caused by supplementary elements in the case of inconsistent lengths. Further, we apply the new hesitant fuzzy distance to the hierarchical hesitant fuzzy K-means clustering.

The paper is organized as follows. In Methods section, some concepts related to hesitant fuzzy sets are introduced. In Preliminaries section, a new hesitance degree and some new hesitant fuzzy distance measures are proposed, and their properties are discussed. In Some New hesitant fuzzy distance measures section, we applied the new distance measure to the hierarchical hesitant fuzzy K-means clustering algorithm. The fifth section is the conclusion of this paper.

Preliminaries

Definition 1

[8] Given a fixed set X, then a hesitant fuzzy set (HFS) on X is in terms of a function that when applied to X returns a subset of [0, 1].

For convenience, Xia and Xu [33] usually express HFS simply as a mathematical symbol:

$$ E=\left\{< x, h_{E}(x)>\mid x \in X\right\} $$

(1)

where h_{E}(x) is a set of some different values in [0,1], representing the possible membership degrees of the element x∈X to E. For convenience, we call h=h_{E}(x) a hesitant fuzzy element (HFE) and H the set of all HFEs.

For the convenience of comparison, We arrange the elements in h_{E}(x_{i}) in increasing order, and let \(h_{E}^{\sigma (j)}(x_{i})\)be the jth largest value in h_{E}(x_{i}).

Li [32] put forward the axiomatic definition of distance measure for hesitant fuzzy sets (HFSs).

Definition 2

[32]. Let A, B and C be three HFSs on X. Then, d is called a hesitant fuzzy distance measure for HFSs, which satisfies the following properties:

(1) 0≤d(A,B)≤1;

(2) d(A,B)=0 if and only if A=B;

(3) d(A,B)=d(B,A);

(4) d(A,B)+d(B,C)≥d(A,C).

It is noted that the number of values in different HFEs may be different, Xu and Xia extend the shorter one by adding the same value until both of them have the same length when we compare them. Let l(h_{E}(x_{i})) be the number of values in h_{E}(x_{i}), and \(l_{x_{i}}=max\{l(h_{A}(x_{i})),l(h_{B}(x_{i}))\}\). Xu and Xia [28] proposed a series of hesitant fuzzy set distances as follows:

Definition 3

[28]. Let A and B be two HFSs on X={x_{1},x_{2},…,x_{n}}. Then, the hesitant normalized Hamming distance as follows:

In order to measure the deviation of each HFE in each HFS, Zhang and Xu [18] et al. proposed the concept of hesitance degree of HFS.

Definition 4

[18]. Let H be an HFS in a reference set X, denoted by H={<x,h_{H}(x)>∣x∈X} and \(h_{H}(x_{i})=\left \{h_{H}^{\sigma (j)}(x_{i}) \mid j=1,2, \ldots, l_{h}\right \}\). Then, the hesitance degree of x in H can be defined as follows:

where l_{h} is the number of the elements in h_{H}(x_{i}).

In general, the bigger the range among the possible values in each HFE is, the larger the hesitance degree of the HFE is. By considering the impact of the hesitance degree of HFEs, Xu and Zhang proposed a new method for measuring the distance between HFSs:

Definition 5

[18]. Let A and B be two HFSs on X. Then, the hesitant normalized Hamming distance including hesitance degree between A and B is defined as:

$$ {}d_{h z h}(A, B) =\frac{1}{n} \sum_{i=1}^{n}\left(\frac{\alpha}{l_{x_{i}}}\sum_{j=1}^{l_{x_{i}}}\left|h_{A}^{\sigma(j)}\left(x_{i}\right)- h_{B}^{\sigma(j)}\left(x_{i}\right)\right|+\beta\left|h_{Z}\left(h_{A}\left(x_{i}\right)\right)-h_{Z}\left(h_{B}\left(x_{i}\right)\right)\right|\right) $$

(6)

the hesitant normalized Euclidean distance including hesitance degree is defined as:

$$ {}d_{h z e}(A, B) \,=\, \left[\!\frac{1}{n} \sum_{i=1}^{n}\left(\frac{\alpha}{l_{x_{i}}} \sum_{j=1}^{l_{x_{i}}}\left|h_{A}^{\sigma(j)}\left(x_{i}\right)- h_{B}^{\sigma(j)}\left(x_{i}\right)\right|^{2}+\beta\left|h_{Z}\left(h_{A}\left(x_{i}\right)\right)-h_{Z}\left(h_{B}\left(x_{i}\right)\right)\right|^{2}\right)\!\right]^{\frac{1}{2}} $$

(7)

the generalized hesitant normalized distance including hesitance degree is defined as:

$$ {}d_{h z g}(A, B) \,=\,\left[\!\frac{1}{n} \sum_{i=1}^{n}\left(\frac{\alpha}{l_{x_{i}}} \sum_{j=1}^{l_{x_{i}}}\left|h_{A}^{\sigma(j)}\left(x_{i}\right)-h_{B}^{\sigma(j)}\left(x_{i}\right)\right|^{\lambda}+\beta\left|h_{Z}\left(h_{A}\left(x_{i}\right)\right)-h_{Z}\left(h_{B}\left(x_{i}\right)\right)\right|^{\lambda}\right)\!\right]^{\frac{1}{\lambda}} $$

(8)

where λ>0, α,β∈[0,1],α+β = 1, \(h_{A}^{\sigma (j)}(x_{i})\) and \(h_{B}^{\sigma (j)}(x_{i})\) are the jth values in h_{A}(x_{i})and h_{B}(x_{i}), respectively. h_{Z}(h_{A}(x_{i})) and h_{Z}(h_{B}(x_{i})) are referred to the hesitance degree of two HFEs h_{A}(x_{i}) and h_{B}(x_{i}), respectively.

Li [32] defined a hesitance degree based on the number of hesitant fuzzy values in hesitant fuzzy elements, and proposed a series of hesitant fuzzy distance measures.

Definition 6

[32]. Let H be an HFS on X={x_{1},x_{2},…,x_{n}}. Then, the hesitance degree of x in H can be defined as follows:

Let M_{1},M_{2},…,M_{m} and B be a set of HFS on X={x_{1},x_{2},…,x_{n}},then for any M_{k} and M_{t},k,t=1,2,…,m, the normalized Hamming distance including hesitance degree between M_{k} and M_{t} is defined as follows:

$$ {}d_{h l h}\left({M_{k},M_{t}}\right)=\frac{1}{2 n} \sum_{i=1}^{n}\left[\left|h_{L}\left(h_{{M_{k}}}\left(x_{i}\right)\right)-h_{L}\!\left(\!h_{{M_{t}}}\!\left(x_{i}\!\right)\right)\right|+\frac{1}{l\left(x_{i}\right)}\! \sum_{j=1}^{l\left(x_{i}\right)} | h_{{M_{k}}}^{\sigma(j)}\!\left(x_{i}\right)-h_{{M_{t}}}^{\sigma(j)}\!\left(x_{i}\right)|\right] $$

(11)

the normalized Euclidean distance including hesitance degree between M_{k} and M_{t} is defined as follows:

where λ≥1, \(l(x_{i})=max\{l(h_{{M_{k}}}\left (x_{i}\right)),l(h_{{M_{t}}}\left (x_{i}\right)))\}\), \(h_{{M_{k}}}^{\sigma (j)}\left (x_{i}\right)\) and \(h_{{M_{t}}}^{\sigma (j)}\left (x_{i}\right)\) are the jth values in \(h_{{M_{k}}}\left (x_{i}\right)\) and \(h_{{M_{t}}}\left (x_{i}\right)\), respectively.

In order to relax the limitation that the corresponding hesitant fuzzy elements have the same length. Tang et al. [31] proposed a series of distance measures.

Definition 8

Let A and B be two HFSs on X. Then, the hesitant normalized Hamming distance between A and B is defined as:

$$ d_{lth}(A, B) =\frac{1}{n} \sum\limits_{i=1}^{n}\frac{\sum_{j=1}^{l_{A}\left(x_{i}\right)} \!\sum_{k=1}^{l_{B}\left(x_{i}\right)}|h_{A}^{\sigma(j)}\!\left(x_{i}\right) -h_{B}^{\sigma(k)}\!\left(x_{i}\right)|}{l_{A}\left(x_{i}\right) l_{B}\left(x_{i}\right)} $$

(14)

the normalized Euclidean distance between A and B is defined as follows:

$$ d_{lte}(A, B) =\left[\frac{1}{n} \sum_{i=1}^{n}\frac{\sum_{j=1}^{l_{A}\left(x_{i}\right)} \!\sum_{k=1}^{l_{B}\left(x_{i}\right)}(h_{A}^{\sigma(j)}\!\left(x_{i}\right) \!-h_{B}^{\sigma(k)}\!\left(x_{i}\right))^{2}}{l_{A}\left(x_{i}\right) l_{B}\left(x_{i}\right)}\right]^{\frac{1}{2}} $$

(15)

the normalized generalized distance between A and B is defined as follows:

$$ d_{ltg}(A, B) =\left[\frac{1}{n} \sum_{i=1}^{n}\frac{\sum_{j=1}^{l_{A}\left(x_{i}\right)} \!\sum_{k=1}^{l_{B}\left(x_{i}\right)}|h_{A}^{\sigma(j)}\!\left(x_{i}\right) \!-h_{B}^{\sigma(k)}\!\left(x_{i}\right)|^{\lambda}}{l_{A}\left(x_{i}\right) l_{B}\left(x_{i}\right)}\right]^{\frac{1}{\lambda}} $$

(16)

where λ>0, \(h_{A}^{\sigma (j)}\left (x_{i}\right)\) are the jth values in h_{A}(x_{i}) and \(h_{B}^{\sigma (k)}\left (x_{i}\right)\) are the kth values in h_{B}(x_{i}), l_{A}(x_{i}) and l_{B}(x_{i}) are the lengths of h_{A}(x_{i}) and h_{B}(x_{i}), respectively.

Some New hesitant fuzzy distance measures

According to analysis, the existing method only considers the number or the degree of dispersion, which is obviously one-sided. Therefore, by simultaneously considering them, we propose a new hesitance degree as follows.

Definition 9

Let A be an HFS in a reference set X = {x_{1},x_{2},…,x_{n}}, denoted by A={<x_{i},h_{A}(x_{i})>∣x_{i}∈X} and \(h_{A}(x_{i})=\left \{h_{A}^{\sigma (j)}(x_{i}) \mid j=1,2, \ldots, l_{h_{A}}\right \}\). Then, the hesitance degree of x in A can be defined as follows:

n is the number of digits after the decimal point of the hesitant fuzzy element, then g=1/10^{n}. For example, let h={0.2,0.3} be a hesitant fuzzy element, then the minimum accuracy g=1/10=0.1. If h={0.25,0.36}, then the minimum accuracy g=1/10^{2}=0.01.

Next, we use a numerical example to illustrate the advantages of the proposed hesitance degree in processing hesitation fuzzy information.

Example 1

Let h_{1}={03,0.5}, h_{2}={05,0.6} and h_{3}={0.3,0.5,0.6} be three hesitant fuzzy elements, g=0.1, θ=μ=0.5. Then, their hesitance degrees are calculated by the different formulas respectively.

the result calculated by formula (5) is as follows:

the result calculated by formula (17) is as follows:

$$h(h_{1})=0.15, h(h_{2})=0.1, h(h_{3})=0.2 $$

From the above results, we can find that h_{Z}({03,0.5})=h_{Z}(h_{1})=h_{Z}(h_{3})=h_{Z}({0.3,0.5,0.6}) and h_{L}({03,0.5})=h_{L}(h_{1})=h_{L}(h_{2})=h_{L}({05,0.6}). Obviously, the results calculated by formula (5) and formula (9) are unreasonable. However, h(h_{1})≠h(h_{2})≠h(h_{3}). That is to say the proposed hesitance degree can clearly distinguish the hesitance degrees of hesitant fuzzy elements h_{1}, h_{2} and h_{3}, which is consistent with people’s intuitive feeling. Therefore, the proposed hesitance degree is more reasonable than the existing hesitance degree mentioned above.

Based on the proposed hesitance degree, we proposes some new distance measures, which can compare HFEs of equal or unequal length, so we can avoid destroying the original information by adding elements when the length is unequal.

Definition 10

Let h_{A}(x_{i}) and h_{B}(x_{j}) be two HFEs. Then, the normalized Hamming distance between h_{A}(x_{i}) and h_{B}(x_{j}) is defined as:

where λ>0, α,β∈[0,1],α+β=1, \(l_{h_{A}}\) and \(l_{h_{B}}\) are the lengths of HFEs h_{A}(x_{i}) and h_{B}(x_{j}), respectively.

Especially, if λ=1, then formula (22) degenerates to formula (21). If λ=2, then formula (22) degenerates to formula (20). If λ→∞, then formula (22) degenerates to formula (21).

Example 2

Let h_{1}={03,0.5}, h_{2}={05,0.6} and h_{3}={0.3,0.5,0.6} be three hesitant fuzzy elements, g=0.1, θ=μ=0.5. Then, the process of calculating the Hamming distance between HFEs is as follows

Let {A_{1},A_{2},…,A_{m}} be a set of HFS on X={x_{1},x_{2},…,x_{n}}, I={1,2,…,m}, k,t∈I. Then, d_{hllh}(A_{k},A_{t}), d_{hlle}(A_{k},A_{t}), and d_{hllg}(A_{k},A_{t}) are hesitant fuzzy distances.

Proof

As d_{hllh}, d_{hlle} and d_{hd} are the special cases of d_{hllg}, here we only prove that d_{hllg} is a distance measure. According to Definition 10, it can be obtained easily that Property (1) and Property (2) in Definition 2 hold. In the following, we prove that Property (3) and Property (4) hold. □

Let h_{1}={0.1,0.5}, h_{2}={0.3,0.8}, h_{3}={0.5,0.6}, h_{4}={0.3,0.5} and h_{5}={0.3,0.5,0.6} be five hesitant fuzzy elements, g=0.1, θ=μ=0.5, α=β=0.5. Then use different formulas to calculate the distance measures. The results are shown in Table 1.

From Table 1, it can be seen that d_{hllh}(h_{1},h_{1}) = 0 and d_{hllh}(h_{1},h_{2})≠d_{hllh}(h_{1},h_{3})≠d_{hllh}(h_{1},h_{4})≠d_{hllh}(h_{1},h_{5}), which is consistent with people’s intuitive feeling. That means the results based on proposed distance measure is more reasonable than those of the above mentioned distance measures.

On the other hand, we compare the characteristics of the proposed distance measure with those of the existing distance measures. The results are shown in Table 2.

From Table 2, it can be seen that the proposed distance measure has all listed characteristics, but the mentioned distance measures do not have all of them. This means that the proposed distance measure is superior to the existing distance measures above in many complex situations.

Hesitant fuzzy clustering based on new distance measure

The description of clustering Algorithm

Recently, many studies focus on the clustering analysis of HFSs. Chen and Xu [35] focused on studied the clustering for hesitant fuzzy sets based on the K-means clustering algorithm, which uses the result of hierarchical clustering as the initial clusters. Zhang and Xu [36] proposed a novel hesitant fuzzy agglomerative hierarchical clustering algorithm. The algorithm considers each of the given HFSs as a unique cluster, and then compares each pair of the HFSs by using the weighted Hamming distance or the weighted Euclidean distance. The two clusters with smaller distance are jointed. Repeat the process until the desired number of clusters is achieved.

We focused on studied the hierarchical hesitant fuzzy K-means clustering algorithm, and using the new distance measure to calculate the distance between hesitant fuzzy sets. The specific steps of the hierarchical hesitant fuzzy K-means clustering algorithm are as follows:

step1. (Hierarchical clustering) Consider each hesitant fuzzy set A_{i}(i=1,2,…,n) as an independent cluster {A_{1}},{A_{2}},…,{A_{n}}. Then calculate the distance between A_{i} and A_{j}, which is denoted by d_{ij}=d(A_{i},A_{j}). The two clusters with smaller distance are jointed by average function, which is given as follows:

This iterative process is repeated until all clusters are aggregated into one cluster.

step2. According to the given number of clusters, select the corresponding result in step 1 as the initial cluster, then calculate the distance between the hesitant fuzzy set A_{i}(i=1,2,…,n) and the center of each cluster. Finally classify A_{i} to the cluster with the closest cluster center.

step3. Recalculate the new cluster center through the average function of the hesitant fuzzy set.

step4. Repeat steps 2 and 3 until all cluster centers are stable.

Illustrative example

A specific example (adapted from Ref. [35]) is given below to illustrate the above algorithm. The proposed hesitant fuzzy distance is applied to the hierarchical hesitant fuzzy K-means clustering algorithm.

There are five tourism resources need to be evaluated and classified. Experts give corresponding evaluation information (g=0.1,θ=μ=0.5,α=β=0.5) to tourism resources from six aspects, namely: scale, environmental conditions, integrity, service, tourist routes and convenient transportation, which is expressed as X={x_{1},x_{2},…,x_{6},}, the evaluation information of the five tourism resources is represented by hesitant fuzzy sets A_{i}=(i=1,2,3,4,5), which are listed in Table 3:

step1. Consider each hesitating fuzzy set A_{i}(i=1,2,3,4,5) as an independent cluster: {A_{1}},{A_{2}}, {A_{3}},{A_{4}} and {A_{5}}.Using the formula 21 calculate the distance between each hesitant fuzzy set and the other four hesitant fuzzy sets:

Obviously, {A_{2}} and {A_{3}} are the two closest clusters, then calculate the new cluster {A_{2},A_{3}} by formula (25). Therefore, the hesitant fuzzy set A_{i}(i=1,2,3,4,5) is divided into the following four clusters: {A_{1}},{A_{2},A_{3}},{A_{4}} and {A_{5}}. Continue to calculate the distance between each cluster and the other three clusters:

Because of {A_{2},A_{3}} and {A_{5}} are the two closest clusters, then the hesitant fuzzy sets are divided into the following three clusters: {A_{2},A_{3},A_{5}},{A_{1}} and {A_{4}}. Calculate the new cluster and the distances between each cluster and the other clusters:

Where {A_{1}} and {A_{4}} are the two closest clusters, then the hesitant fuzzy sets are divided into two clusters: {A_{2},A_{3},A_{5}} and {A_{1},A_{4}}.

In the end, the two clusters merged into one cluster: {A_{1},A_{2},A_{3},A_{4},A_{5}}.

step2. Assuming number of clusters c=3 is given, according to the result of step1, then c_{1}={A_{1}}, c_{2}={A_{2},A_{3},A_{5}} and c_{3}={A_{4}} are selected as the initial clusters. Next, calculate the distances of each hesitant fuzzy set A_{i}(i=1,2,…,5) between each initial cluster c_{j}(j=1,2,3) as follows:

According to the above calculation results, the clustering result is c_{1}={A_{1}}, c_{2}={A_{2},A_{3},A_{5}} and c_{3}={A_{4}}.

step3. The cluster center remains unchanged and the iteration ends.

Comparative analysis

In order to illustrate the performance of the proposed method, we make a comparative analysis with the hierarchical hesitant fuzzy k-means clustering algorithm introduced by Chen et al. [35].

Consider each hesitating fuzzy set A_{i}(i=1,2,3,4,5) as an independent cluster: {A_{1}},{A_{2}},{A_{3}},{A_{4}} and {A_{5}}. Calculating the distance between each hesitant fuzzy set and the other four hesitant fuzzy sets:

We can find d(A_{2},A_{3})=d(A_{3},A_{4})= min{d(A_{i},A_{j})∣i,j=1,2,3,4,5(i≠j)}=0.2222, there are two options when merging the two clusters into a new cluster. Therefore, the following two cases are considered.

case1: Hesitant fuzzy sets A_{i}(i=1,2,3,4,5) are divided into the following four clusters: {A_{1}}{A_{2},A_{3}}{A_{4}} and {A_{5}}. Calculate the distances between each cluster and the other three clusters. We have d({A_{2},A_{3}},A_{5}) is the shortest distance. Merging {A_{2},A_{3}} and {A_{5}} into a new cluster, the hesitant fuzzy sets are divided into three clusters: {A_{2},A_{3},A_{5}}{A_{1}} and {A_{4}}. Calculate the new cluster and the distances between each cluster and the other clusters. We have d(A_{1},A_{4}) is the shortest distance. Therefore, hesitant fuzzy sets are divided into the following two clusters: {A_{2},A_{3},A_{5}} and {A_{1},A_{4}}. In the end, the two clusters are merged into one cluster: {A_{1},A_{2},A_{3},A_{4},A_{5}}.

case2: Hesitant fuzzy sets A_{i}(i=1,2,3,4,5) are divided into the following four clusters: {A_{1}} {A_{2}} {A_{3},A_{4}} and {A_{5}}. Calculate the distance between each cluster and the other three clusters. We have d(A_{2},A_{5}) is the shortest distance. Merging {A_{2}} and {A_{5}} into a new cluster, the hesitant fuzzy set is divided into three clusters: {A_{1}},{A_{3},A_{4}} and {A_{2},A_{5}}. Calculate the new cluster and the distances between each cluster and the other clusters. We have d({A_{3},A_{4}},{A_{2},A_{5}}) is the shortest distance. Therefore, hesitant fuzzy sets are divided into two clusters: {A_{1}} and {A_{2},A_{3},A_{4},A_{5}}. In the end, the two clusters are merged into one cluster: {A_{1},A_{2},A_{3},A_{4},A_{5}}.

Obviously, the clustering results obtained in different cases are different. Next, we analyze the quality of the clustering results of the two cases. Generally, the average distance d_{ρ} is an indicator to measure the quality of clustering results. The smaller the d_{ρ}, the better the clustering result. The calculation process is as follows:

It can be seen that d_{ρ}({A_{2},A_{3}}) is smaller than d_{ρ}({A_{3},A_{4}}). Therefore, the clustering result of case1 is better than case2.

Results and Discussion

According to the above analysis, the comparison result is shown in Table 4.

From Table 4, we can find that there are two different clustering results using Chen’s method introduced in [35]. It is very difficult to decide which one to choose in the clustering process. And even if it can be selected correctly, it will increase the complexity of the algorithm. However, a unique clustering result can be obtained by the proposed method. And the result is same as the best one obtained by Chen’s method. Therefore, the hierarchical hesitant fuzzy k-means clustering method based on the proposed distance measure is more reasonable and effective.

Conclusions

Considering the existing hesitance degrees does not take into account both degree of dispersion and number of the hesitant fuzzy values in the hesitant fuzzy element, a new hesitance degree is defined in this paper, which has better accuracy and applicability. We have elaborated the important role of hesitance degree in hesitant fuzzy distance measure. Further, we proposed some hesitant fuzzy distance measures based on the new hesitance degree, which can overcome the shortcomings of the existing distance measures. Moreover, we applied the new hesitant fuzzy distance to the hierarchical hesitant fuzzy k-means clustering algorithm, and presented an example to illustrate the effectiveness of the proposed method. In addition, we have compared and analyzed with the existing hierarchical hesitant fuzzy k-means clustering algorithm. It has been found that the clustering algorithm based on new distance measure is more reasonable. The proposed distance measure can avoid the original information distortion and have higher resolution. Therefore, it can help decision-makers get the only ideal results in practical problems. In the future, We will apply the proposed distance measure to multi-attribute group decision-making. We will extend this approach to interval valued environment. We will develop the knowledge measure [37] for hesitant fuzzy set.

Availability of data and materials

All data generated or analysed during this study are included in this published article.

References

Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338–353.

Mjka B, Pka B, Wd C, Wk D, Zsa B (2020) Bi-parametric distance and similarity measures of picture fuzzy sets and their applications in medical diagnosis. Egypt Inf J 22(2):201–212.

Zhang, Zhiming (2013) Hesitant fuzzy power aggregation operators and their application to multiple attribute group decision making. Inf Sci 234(Complete):150–181.

Xu Z, Zhang X (2013) Hesitant fuzzy multi-attribute decision making based on topsis with incomplete weight information. Knowl-Based Syst 52(nov):53–64.

Liu X, Zhu J, Liu S (2014) Similarity measure of hesitant fuzzy sets based on symmetric cross entropy and its application in clustering analysis. Control Decis 29(10):1816–1822.

Zhang X, Xu Z (2015) Novel distance and similarity measures on hesitant fuzzy sets with applications to clustering analysis. J Intell Fuzzy Syst 28(5):2279–2296.

Akram M, Adeel A, Al-Kenani AN, Alcantud JCR (2020) Hesitant fuzzy n-soft electre-ii model: a new framework for decision-making. Neural Comput Appl 3:1–16.

Deli I, Karaaslan F (2020) Generalized trapezoidal hesitant fuzzy numbers and their applications to multi criteria decision-making problems. Soft Comput 25(1):1017–1032.

Karaaslan F, Özlü Ş (2019) Some distance measures for type-2 hesitant fuzzy sets and their applications to multi-criteria group decision making problems. Soft Comput 24(1):9965–9980.

Su Z, Xu Z, Liu H, Liu S (2015) Distance and similarity measures for dual hesitant fuzzy sets and their applications in pattern recognition. J Intell Fuzzy Syst 29(2):731–745.

Zeng W, Li D, Qian Y (2016) Distance and similarity measures between hesitant fuzzy sets and their application in pattern recognition 84:267–271.

Zhang F, Chen S, Li J, Huang W (2018) New distance measures on hesitant fuzzy sets based on the cardinality theory and their application in pattern recognition. Soft Comput Fusion Found Methodologies Appl 22(4):1237–1245.

Tong X, Yu L (2016) Madm based on distance and correlation coefficient measures with decision-maker preferences under a hesitant fuzzy environment. Soft Comput 20(11):4449–4461.

Peng D, Gao C, Gao Z (2013) Generalized hesitant fuzzy synergetic weighted distance measures and their application to multiple criteria decision-making. Appl Math Modell 37(8):5837–5850.

Tang X, Peng Z, Ding H, Cheng M, Yang S, Li C, de Oliveira José Valente (2018) Novel distance and similarity measures for hesitant fuzzy sets and their applications to multiple attribute decision making. J Intell Fuzzy Syst 34(6):3903–3916.

Hatzimichailidis AG, Papakostas GA, Kaburlasos VG (2012) A novel distance measure of intuitionistic fuzzy sets and its application to pattern recognition problems. Int J Intell Syst 27:396–409.

The work is supported by the Key Research and Development Project of Hunan Province (No. 2019SK2331), the Natural Science Foundation of Hunan Province (Nos: 2018JJ3213, 2019JJ40100, 2019JJ40099), the Key scientific research projects of Hunan Education Department (Nos: 18A317, 19A202) and the Innovation Foundation for Postgraduate of Hunan Institute of Science and Technology (No. YCX2020A34).

Author information

Authors and Affiliations

School of Information Science And Engineering, Hunan Institute of Science and Technology, Yueyang, Hunan, People’s Republic of China

Fuping Liao was a major contributor in writing the manuscript. Wu Li conducted experimental analysis and comparison. Both of them were the main authors of the manuscript. Xiaoqiang Zhou and Gang Liu corrected the grammar of the paper. All authors read and approved the final manuscript.

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Liao, F., Li, W., Zhou, X. et al. Novel distance measures of hesitant fuzzy sets and their applications in clustering analysis.
J. Eng. Appl. Sci.69, 115 (2022). https://doi.org/10.1186/s44147-022-00095-3