blocks|key|3693599|text|duplicated()有一个针对data.frames的方法，它就是为这类任务设计的：|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|3693600|df+<-+data.frame(a+=+c(1:4,+1:4),+
+++++++++++++++++b+=+c(4:1,+4:1),+
+++++++++++++++++d+=+LETTERS[1:8])

df[!duplicated(df[c("a",+"b")]),]
#+++a+b+d
#+1+1+4+A
#+2+2+3+B
#+3+3+2+C
#+4+4+1+D|code-block|syntax|javascript|3693601|entityMap^0|0|C|H|A|0|0^^$0|@$1|2|3|4|5|6|7|M|8|@$9|N|A|O|B|C]|$9|P|A|Q|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|R|8|@]|D|@]|E|$I|J]]|$1|K|3|-4|5|6|7|S|8|@]|D|@]|E|$]]]|L|$]]

<code>duplicated()</code> has a method for <code>data.frame</code>s, which is designed for just this sort of task:

<pre><code>df &lt;- data.frame(a = c(1:4, 1:4), 
 b = c(4:1, 4:1), 
 d = LETTERS[1:8])

df[!duplicated(df[c("a", "b")]),]
# a b d
# 1 1 4 A
# 2 2 3 B
# 3 3 2 C
# 4 4 1 D
</code></pre>

blocks|key|58192|text|为了解决排序问题，首先读入示例数据：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|58193|dat+<-+read.table(text+=+"+++++++++++++++sessionid+++++++++++++qf++++++++qn+++++++++city
1++9cf571c8faa67cad2aa9ff41f3a26e38+++++cat+++biddix++++++++++fresno
2++e30f853d4e54604fd62858badb68113a+++caleb+++++amos+++++++++++++NA+++
3++2ad41134cc285bcc06892fd68a471cd7++daniel++folkers+++++++++++++NA+++
4++2ad41134cc285bcc06892fd68a471cd7++daniel++folkers+++++++++++++NA+++
5++63a5e839510a647c1ff3b8aed684c2a5+charles+++pierce+++++++++++flint
6++691df47f2df12f14f000f9a17d1cc40e+++++++j++++franz+prescott%2Bvalley
7++691df47f2df12f14f000f9a17d1cc40e+++++++j++++franz+prescott%2Bvalley
8++b3a1476aa37ae4b799495256324a8d3d++carrie+mascorro++++++++++++brea
9++bd9f1404b313415e7e7b8769376d2705++++fred++morales+++++++las%2Bvegas
10+b50a610292803dc302f24ae507ea853a++aurora++++++lee++++++++++++++NA++
11+fb74940e6feb0dc61a1b4d09fcbbcb37++andrew++++price+++++++yorkville+",sep+=+"",header+=+TRUE)|code-block|syntax|javascript|58194|然后您可以使用plyr中的arrange，|offset|length|style|BOLD|CODE|58195|arrange(dat,sessionid,qf,qn)|58196|或者使用基函数，|58197|with(dat,dat[order(sessionid,qf,qn),])|58198|entityMap^0|0|0|7|4|D|7|0|0|0|0^^$0|@$1|2|3|4|5|6|7|V|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|W|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|X|8|@$I|Y|J|Z|K|L]|$I|10|J|11|K|M]]|9|@]|A|$]]|$1|N|3|O|5|D|7|12|8|@]|9|@]|A|$E|F]]|$1|P|3|Q|5|6|7|13|8|@]|9|@]|A|$]]|$1|R|3|S|5|D|7|14|8|@]|9|@]|A|$E|F]]|$1|T|3|-4|5|6|7|15|8|@]|9|@]|A|$]]]|U|$]]

To address your sorting problems, first reading in your example data:

<pre><code>dat &lt;- read.table(text = " sessionid qf qn city
1 9cf571c8faa67cad2aa9ff41f3a26e38 cat biddix fresno
2 e30f853d4e54604fd62858badb68113a caleb amos NA 
3 2ad41134cc285bcc06892fd68a471cd7 daniel folkers NA 
4 2ad41134cc285bcc06892fd68a471cd7 daniel folkers NA 
5 63a5e839510a647c1ff3b8aed684c2a5 charles pierce flint
6 691df47f2df12f14f000f9a17d1cc40e j franz prescott+valley
7 691df47f2df12f14f000f9a17d1cc40e j franz prescott+valley
8 b3a1476aa37ae4b799495256324a8d3d carrie mascorro brea
9 bd9f1404b313415e7e7b8769376d2705 fred morales las+vegas
10 b50a610292803dc302f24ae507ea853a aurora lee NA 
11 fb74940e6feb0dc61a1b4d09fcbbcb37 andrew price yorkville ",sep = "",header = TRUE)
</code></pre>

and then you can use <code>arrange</code> from plyr,

<pre><code>arrange(dat,sessionid,qf,qn)
</code></pre>

or using base functions,

<pre><code>with(dat,dat[order(sessionid,qf,qn),])
</code></pre>

blocks|key|1075490|text|在您的示例中，重复的行是完全重复的。unique与data.frames一起工作。|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|1075491|udf+<-+unique(+my.data.frame+)|code-block|syntax|javascript|1075492|至于排序..。joran刚刚发布了答案。|1075493|entityMap^0|I|6|0|0|0^^$0|@$1|2|3|4|5|6|7|O|8|@$9|P|A|Q|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|R|8|@]|D|@]|E|$I|J]]|$1|K|3|L|5|6|7|S|8|@]|D|@]|E|$]]|$1|M|3|-4|5|6|7|T|8|@]|D|@]|E|$]]]|N|$]]

In your example the repeated rows were entirely repeated. <code>unique</code> works with data.frames.

<pre><code>udf &lt;- unique( my.data.frame )
</code></pre>

As for sorting... joran just posted the answer.

blocks|key|3693709|text|如果你使用duplicated两次，它就会起作用：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|3693710|>+df

++a++b+c++++d
1+1++2+A+1001
2+2++4+B+1002
3+3++6+B+1002
4+4++8+C+1003
5+5+10+D+1004
6+6+12+D+1004
7+7+13+E+1005
8+8+14+E+1006

>+df[!(duplicated(df[c("c","d")])+%7C+duplicated(df[c("c","d")],+fromLast+=+TRUE)),+]

a++b+c++++d
1+1++2+A+1001
4+4++8+C+1003
7+7+13+E+1005
8+8+14+E+1006|code-block|syntax|javascript|3693711|entityMap^0|0|0^^$0|@$1|2|3|4|5|6|7|I|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|J|8|@]|9|@]|A|$E|F]]|$1|G|3|-4|5|6|7|K|8|@]|9|@]|A|$]]]|H|$]]

It works if you use duplicated twice:

<pre><code>&gt; df

 a b c d
1 1 2 A 1001
2 2 4 B 1002
3 3 6 B 1002
4 4 8 C 1003
5 5 10 D 1004
6 6 12 D 1004
7 7 13 E 1005
8 8 14 E 1006

&gt; df[!(duplicated(df[c("c","d")]) | duplicated(df[c("c","d")], fromLast = TRUE)), ]

a b c d
1 1 2 A 1001
4 4 8 C 1003
7 7 13 E 1005
8 8 14 E 1006
</code></pre>

I want to remove duplicate combinations of sessionid, qf and qn from the following data 

<pre><code> sessionid qf qn city
1 9cf571c8faa67cad2aa9ff41f3a26e38 cat biddix fresno
2 e30f853d4e54604fd62858badb68113a caleb amos 
3 2ad41134cc285bcc06892fd68a471cd7 daniel folkers 
4 2ad41134cc285bcc06892fd68a471cd7 daniel folkers 
5 63a5e839510a647c1ff3b8aed684c2a5 charles pierce flint
6 691df47f2df12f14f000f9a17d1cc40e j franz prescott+valley
7 691df47f2df12f14f000f9a17d1cc40e j franz prescott+valley
8 b3a1476aa37ae4b799495256324a8d3d carrie mascorro brea
9 bd9f1404b313415e7e7b8769376d2705 fred morales las+vegas
10 b50a610292803dc302f24ae507ea853a aurora lee 
11 fb74940e6feb0dc61a1b4d09fcbbcb37 andrew price yorkville 
</code></pre>

I read in the data as a data.frame and call it mydata. Heree is the code I have so far, but I need to know how to first sort the data.frame correctly. Secondly remove the duplicate combinations of sessionid, qf, and qn. And lastly graph in a histogram characters in the column qf

<pre><code>sortDATA&lt;-function(name)
{
#sort the code by session Id, first name, then last name
sort1.name &lt;- name[order("sessionid","qf","qn") , ]
#create a vector of length of first names
sname&lt;-nchar(sort1.name$qf)
hist(sname)
}
</code></pre>

thanks!

Remove duplicates column combinations from a dataframe in R

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

我想从以下数据中删除sessionid、qf和qn的重复组合               sessionid             qf        qn         city1  9cf571c8faa67cad2aa9ff41f3a26e38     cat   biddix          fresno...

问从R中的数据框中删除重复的列组合
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问从R中的数据框中删除重复的列组合EN